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ABSTRACT 

The  purpose  of  this  -sTtrd-y  was  to  develop  a  computer 
algorithm  to  segment  letters  in  a  word  so  that  pattern 
recognition  techniques  such  as  two  dimensional  Fourier 
Transforms  could  be  used  to  recognize  the  individual 
letters.  This  study  did  not  intend  to  cover  various  pattern 
recognition  techniques  but,  as  the  algorithm  developed 
letter  features  were  recognized  to  gather  information  or 
clues  on  adjacent  touching  letters  to  decide  on  possible 
segmentation  locations.  This  study  was  limited  to  the 
segmentation  of  lower  case  letters  in  the  English  alphabet 
excluding  stylized  print  and  italic  print.  Lower  case  print 
was  chosen  because  it  represents  the  worst  case  task  for  a 
segmentation  algorithm.  Two  adjacent  lower  case  letters 
often  look  like  a  third  lower  case  letter.  However,  in 
upper  case  letters  similiar  occurrences  are  rare.  Hand 
printed  touching  letters  were  selected  to  demonstrate  the 
validity  of  the  algorithm.  ‘  •  /, 


INTRODUCTION 


1 . I  BACKGROUND 

Since  the  advent  of  high  speed  computers,  users  have 
had  to  enter  hardcopy  text  into  data  files  by  physically 
retyping  the  text  on  a  keyboard.  For  secretaries  and  other 
computer  users,  this  retyping  requires  many  hours  of  tedious 
manual  labor  at  a  time  when  labor  is  becoming  more  expensive 
while  the  cost  of  electronically  using  and  storing  that  data 
is  rapidly  declining  (  7  :  105  ).  The  ideal  solution  would 
be  a  computerized  reading  machine  or  optical  character 
reader  (  OCR  )  which  electronically  scans  and  digitizes  text 
and  converts  the  text  into  ASCII  data  to  be  stored  on 
magnetic  disk  or  manipulated  by  computer  software. 

OCR's  have  many  applications  besides  just  text  storage. 
The  U.S.  Post  Office  has  been  trying  to  find  a  fast  and 
reliable  reading  machine  for  sorting  mail  since  the  early 
1960’s.  The  Foreign  Technology  Division,  at  Wright 
Patterson  A.F.B.,  Ohio,  has  been  using  a  slow,  error  prone 
reading  machine  as  a  front  end  processor  for  computer 
translation  of  Russian  text  (  A  ).  A  reading  machine  could 
also  be  used  to  preserve  this  nation's  libraries  of  books. 
Presently,  air  pollution  is  deteriorating  all  books  made  of 
wood  pulp.  It  would  be  impossible  to  reprint  all  these 
books  and  many  would  be  lost  for  future  generations  if  the 
technology  to  store  or  preserve  them  is  not  developed. 
Using  present  technology,  it  is  already  possible  to  store 
digitized  pictures  of  each  page  on  high  density  storage 


disks  but,  the  use  of  a  reading  machine  using  an  entropy 
efficient  code  would  greatly  reduce  the  storage  space  and 
cost  enabling  entire  libraries  to  be  stored  on  a  few  disks. 
By  reducing  the  data  bits  needed  to  store  a  book,  this 
technology  would  make  it  feasible  to  quickly  call  up  any 
book  desired  on  a  home  computer  without  ever  going  to  a 
library.  Another  application  of  OCR  machines  is  aid  to  the 
blind  (  7  110  ).  Kurzweil  Computer  Products  introduced  the 
Reading  Machine  in  1976  which  scans  text  and  reads  it  aloud. 
Although  the  error  rate  may  be  intolerable  for  storing  text, 
the  human  listener  can  still  understand  letter  errors  and 
even  whole  word  errors  due  to  the  context  of  the  words  and 
sentences,  respectively  (  4  ).  More  will  be  said  later  in 
the  text  about  the  Kurzweil  OCR  machines. 

1.2  PROBLEM 

There  are  already  many  reading  machines  on  the  market. 
However,  most  of  them  can  only  read  special  typefaces  such 
as  MICR  (  Magnetic  Ink  Character  Recognition  )  type,  OCR-B, 
or  OCR-A.  These  types  have  additional  features  which  help 
the  reading  machine  distinquish  similiar  characters.  There 
are  also  several  machines  designed  to  handle  large 
quantities  of  type  written  text.  These  machines,  including 
models  made  by  companies  such  as  DEST,  TOTEC,  and  Hendrix, 
read  by  doing  template  matching  of  known  fonts  and  rely  on 
the  uniform  width  and  spacing  of  type  written  text.  Only 


the  Kurzweil  4000  machine  uses  an  Artificial  Intelligence 


variable  width  fonts  found  in  books,  magazines,  and  papers. 
However,  the  Kurzweil  machine  is  only  used  for  large 
quantities  of  the  same  type  of  text  because  the  machine  has 
to  be  "taught"  and  produces  intolerable  errors  until  the 
system  has  "learned." 

The  reading  machine  problem  is  difficult.  Most 
machines  have  no  problem  reading  the  uniformly  spaced  text 
typed  neatly  on  high  quality  white  paper.  In  fact,  most 
template  matching  reading  machines  only  need  half  of  the 
template's  digital  information  to  correctly  recognize  a 
character.  The  problem  occurs  when  the  letters  in  the  text 
have  nonuniform  width  and  uneven  spacing.  The  letters  are 
also  often  overlapping,  or  touching  each  other.  These 
variable  width  or  proportionally  spaced  characters,  as  they 
are  often  called,  confuse  most  reading  machines.  The 
width  of  letters  can  vary  in  range  from  one  to  four  units  of 
length.  In  most  books,  the  letter  "m"  is  four  times  as  wide 
as  the  letter  "i".  Even  proportional  print  characters  could 
be  read  if  the  template  of  each  letter  was  known  before 
hand.  But,  most  text  is  printed  in  varying  fonts  and  in 
varying  pitch.  In  these  cases,  template  matching  fails  and 
a  more  sophisticated  approach  has  to  be  used  (  7  :  106-108  ) 

Noise  is  another  dominant  factor  that  makes  the  reading 
machine  problem  difficult.  Type  written  characters  are 
printed  very  neatly  with  adjacent  characters  separated  from 
each  other.  A  window  can  be  drawn  around  each  typed 


character  without  including  adjacent  characters.  In 
contrast,  characters  in  books  and  newspapers  are  often 
touching  or  overlapping  and  a  window  cannot  be  draw  around 
each  character  without  including  some  of  the  adjacent 
characters.  In  addition,  breaks  in  letters  or  dark  spots 
near  letters  occur  due  to  insufficient  ink  or  poor  printing 
plates.  Also,  threshold  effects  and  noise  produce  random 
white  and  black  spots  in  the  image  when  the  text  is 
digitized  for  computer  processing.  All  these  types  of  noise 
not  only  interfere  with  attempts  to  recognize  the  letters 
but  intefere  with  attempts  to  separate  or  segment  the 
individual  letters  from  each  other. 

1 . 3  SCOPE 

The  goal  of  this  study  was  to  develop  a  computer 
algorithm  to  segment  letters  in  a  word  so  that  pattern 
recognition  techiniques  using  either  Fourier  Transform  or 
feature  recognition  could  recognize  the  separate  individual 
letters.  This  study  did  not  intend  to  cover  various  pattern 
recognition  techniques,  but  as  the  algorithm  evolved  feature 
recognition  was  used  to  gather  information  or  clues  on 
neighboring  letters  to  decide  on  possible  segmentation 
locations.  This  study  was  limited  to  the  segmentation  of 
lower  case  English  alphabet  excluding  stylized  print  and 
italic  characters.  Lower  case  print  was  chosen  because  it 
represents  a  more  difficult  problem  for  a  segmention 
algorithm.  For  example,  in  upper  case  print,  two  adjacent 


capital  letters  rarely  look,  like  a  third  capital  letter 


I 
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except  possibly  in  the  case  of  El  looking  like  B  or  VV 
looking  like  a  W.  However,  in  lower  case  print  these 
problems  and  others  are  quite  common  and  will  be  discussed 
later  in  the  Theoretical  Development.  Handprinted, 
touching,  lower  case  letters  were  selected  in  an  attempt  to 
create  a  worst  case  task  for  algorithm  testing.  Hand 
printed  letter  were  also  used  to  simulate  the  roughness  of 
edges  and  the  filling  in  of  gaps  due  to  the  effects  of 
digitization  and  noise. 

1.4  Assumptions 

The  following  list  of  assumptions  were  made  because  of 
time  constraints  in  order  to  spend  maximum  time  on  the 
segmentation  problem.  Assumptions  3  and  4  were  assumed  to 
exist  for  all  fonts  after  examining  2  2  standard  type  faces 
(  2  :  189-215  ) . 

1.  Noise  filtering  to  remove  isolated  dark  spots  and 
appropriate  thresholding  has  already  been  performed  to 
produce  a  digitized  binary  (black  and  white)  image  of  each 
word. 

2.  The  probability  that  random  noise  occurring  in 
isolated  areas  to  cause  one  letter  to  look  like  another  is 
neg 1 i g i b 1 e  . 

3.  The  width  of  vertical  lines  in  printed  fonts  is 
approximately  constant. 

4  . 
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The  height  of  small  letters  such  as  a,e,c,o,  etc. 


is  approximately  constant  in  printed  fonts. 


6 


( 


5.  The  lines  of  printed  text  will  be  placed 
horizontally  and  the  letters  will  be  place  right  side  up. 

6.  Vertical  lines  in  printed  fonts  are 

approximately  perpendicular  to  the  horizontal  line  of  print. 

1 . 5  APPROACH 

Research  proceeded  from  the  idea  that,  except  for  six 
letters,  English  printed  letters  have  a  at  least  one  region 
of  vertical  continuous  lines  at  the  edge  of  each  letter. 
The  exceptions  are  s,v,w,x,y,  and  z.  Of  these  six 

exceptions,  the  last  five  occur  with  low  probability. 
Taking  advantage  of  this  characteristic,  letter  features 
such  as  vertical  continuous  lines  were  used  as  the  primary 
segmentation  clues  to  predict  where  one  letter  ends  and 
another  begins.  Additional  letter  features  such  as  height, 
holes,  valleys,  and  arches  were  also  detected  to  get  clues 
as  to  what  letters  were  present  on  either  side  of  the 
vertical  continuous  line.  These  segmentation  clues  along 
with  any  detected  spaces  in  a  word  were  used  as  inputs  to  a 
final  segmentation  decision  algorithm. 

1.6  EQUIPMENT  AND  MATERIAL 

The  equipment  and  material  support  was  provided  by  the 
Signal  Processing  Lab  at  the  Air  Force  Institute  of 
Technology  (AFIT).  Most  of  the  work  was  performed  using  the 
Data  General  Nova  2  minicomputer,  the  Data  General  Eclipse 
S/250  minicomputer,  and  the  Octek  2000  Image  Analyzer. 
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ii.  191951119*9  2I¥§99£9I9I 

2.1  CHOOSING  THE  APPROACH 

The  segmentation  of  English  printed  letters  is  a 
problem  that  most  papers  and  articles  describing  reading 
machines  and  character  recognition  do  not  adequatedly 

address.  Three  theses  by  Bentkowski  (  1  ),  Shun  (  5  )  and 

Simmons  (  6  )  at  AFIT  have  tried  to  tackle  this  problem. 

All  three  tried  to  correlate  the  FT  of  the  image  inside  a 
sliding  window  with  the  FT  of  a  letter  template  being 
searched.  However,  in  all  cases,  using  upper  case  letters, 
correlation  peaks  occurred  on  letters  that  did  not  resemble 
the  search  template.  The  sliding  window  approach  has 
several  potential  problems  especially  with  lower  case 
letters.  Picking  a  window  size  could  be  a  problem  with 
letters  varying  in  width  by  as  much  as  4  to  1  units  of 
length.  False  correlations  could  occur  because  two  adjacent 
letters  often  look  like  a  third  letter.  For  example,  "o" 
and  "i»  might  look  like  a  "d".  False  correlations  could 
also  occur  because  shapes  of  one  letter  often  exist  in  other 
letters.  For  example,  the  shape  of  " n "  exist  in  "h"  and 

"m".  Also,  the  shape  of  "o"  exist  in  " b "  ,  " d "  ,  " p "  ,  and  "q". 

Figure  2.  below  shows  other  cases  where  letter  shapes  exist 
in  more  than  one  letter.  For  these  reasons,  the  sliding 
window  correlation  approach  could  only  work  if  extensive 
error  correction  postprocessing  and  "intelligence"  is  added. 


m 


o  b  e  d 

b  d  k  h 

o  b  d  p 

h  a 

h  a  n 


Figure  2.1  Letter  Shape*  Which  Occur  in  Other  Letters 


An  algoritha  to  segaent  English  letters  should  take 
full  advantage  of  the  characteristics  inherent  in  the  shape 
of  the  letters.  This  will  not  only  enable  the  algoritha  to 
be  simpler  but  enable  it  to  run  quicker.  At  present  the 
Kurzweil  4000,  the  only  machine  that  can  read  the 
proportional  print  in  books,  requires  a  ainicoaputer  and 
megabytes  of  memory.  Although,  it  is  proprietary 
information,  the  segmentation  algorithm  probably  takes  up  a 
large  percentage  of  the  memory.  The  segmentation  problem  is 
complex.  It  is  almost  necessary  to  know  what  the  individual 
letters  are  in  a  word  to  decide  where  to  segment  them.  Take 
for  example  the  case  of  a  "on”  letter  combination.  See 
Figure  2.2  below.  Depending  on  where  this  combination  is 
segmented,  the  machine  might  read  a  "cn",  "on",  "on",  or 
even  "cin"  if  noise  adds  a  dot  above  the  vertical  line. 
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cn  an  on  an 


Figure  2.2  Example  of  Ambiguous  Letter  Combination 

2.2  LETTER  FEATURES  AS  SEGMENTATION  CLUES 

As  mentioned  in  the  "Approach",  letter  features  such 
as  vertical  continuous  lines  (  VCL  )  were  used  as  the 
primary  segmentation  clues  from  which  other  segmentation 
clues  on  either  side  of  the  VCL  were  examined.  VCL  is 
defined  as  the  greatest  number  of  black  pixels  adjacent  to 
other  black  pixels  along  a  vertical  line.  The  detection  of 
VCL  regions  was  chosen  because  of  inherent  noise  immunity. 
The  probability  that  random  black  pixels  will  line  up  in  a 
vertical  line  is  negligible.  VCL  regions  occur  along 
vertical  lines  in  a  letter  and  at  the  edge  of  curves  due  to 
the  "squaring"  effect  of  digitization  and  the  thickness  of 
the  curved  line.  Figure  2.3  below  shows  the  VCL's  detected 


II 


in  the  word 


the 


.'Tv. 


For  the  99  cases  where  VCL's  do  not  exist  at  the 
segmentation  point,  a  change  from  one  region  to  another  was 
used  as  a  segmentation  clue.  For  example,  a  change  from  an 
O-region  to  a  U-region  is  used  as  a  segmentation  clue.  This 
segmentation  rule  will  handle  all  but  twelve  cases  (  ea, 
es,  ez,  vv,  vw,  wv ,  ww ,  yy,  wy,  yw ,  vy,  yv,  yy  ). 
Additional  segmentation  rules  can  handle  eight  of  these 
twelve  cases.  The  first  three  of  these  twelve  cases  is 
segmented  at  the  middle  of  the  letter  combination  since  the 
width  of  the  8-region  is  too  large  to  be  caused  by  a  single 
letter  and  e,a,s,  and  z  are  approximately  the  same  width  in 
all  fonts.  (  Examples  of  common  printed  fonts  are  shown  in 
Appendix  B.  )  The  last  five  cases  (  wy ,  yw,  vy,  yv,  yy  ) 
are  segmented  between  the  U-regions  by  noting  there  is  no 
letter  in  the  English  alphabet  which  has  two  U-regions  and 
which  extends  below  the  line  of  print  such  as  the  letter 
"y".  The  last  four  cases  (  vv ,  ww ,  wv ,  vw  )  can  only  be 
d i s t i nqu i s hed  if  the  context  of  the  letters  or  the  type  of 
font  used  is  known.  If  a  gap  or  space  does  not  exist 
between  these  letter  combinations  not  even  a  human  reader 
would  be  able  to  distinguish  these  letter  combinations  from 
false  letters  or  letter  combinations.  (  See  Figure  2.5  ) 
Postprocessing  of  the  recognized  letters  and  the 
questionable  letters  needs  to  be  used  to  consider 
probabilities  of  letter  combinations  and  context  to  derive 
a  "  best  guess  "  for  these  letter  combinations. 


Although  the  lest  four  letter  combinations,  mentioned 
above,  rarely  occur  in  the  English  language,  there  are  two 
more  likely  letter  combinations  which  could  also  not  be 
di st inqui shed  unless  postprocessing  is  also  used.  The  letter 
combinations  "rn"  and  "cl"  could  not  be  distinguished  from 
"m"  and  "d",  respectively,  unless  a  space  exist  between  the 
letters.  The  Figure  2.5  below  is  a  list  of  ambiguous  letter 

combinations  which  could  not  be  distinguished  without  a 

space  between  the  letters. 

r  n  -  m 

cl  -  d 

v  v  -  w 

vw  -  vvv  -  wv 

ww  -  wvv  -  vvw  -  vwv  -  wvv  -  vvvv 

Figure  2.5  Indistinguishable  Touching  Letter 

Combinat ions 

In  addition  to  0,A,U,8  and  VCL  regions,  the  width  and 
height  of  VCL  regions  were  also  used  as  segmentation  clues. 
Take  for  example,  the  letter  combination  "db".  If  a  gap  is 
not  detected  between  these  two  letters,  one  could  not 
determine  whether  it  was  a  "cb"  or  a  "db"  except  by  noticing 
that  the  width  of  the  VCL  region  in  the  middle  of  the  letter 
combination  is  too  wide  for  only  one  letter.  Using  the 

width  of  the  VCL  regions  as  a  segmentation  clue  takes 

advantage  of  the  fact  that  for  all  fonts  the  width  of 

vertical  lines  is  nearly  constant  for  all  letters  in  the 

font  (  1  ).  (  See  Appendix  B  for  examples  of  printed  fonts. 

However,  there  are  exceptions  in  handwritten  print  due  to 


font  (  1  ).  (  See  Appendix  B  for  examples  of  printed  fonts.  ) 

However,  there  are  exceptions  in  handwritten  print  due  to 
the  varying  positions  of  the  writing  instrument.  This 
algorithm  will  assume  a  near  constant  width  for  the  VCL 
regions  of  each  letter.  If  this  assumption  was  not  made. 
Figure  2.6  list  additional  letter  combinations  which  could 
not  be  distinguished,  by  a  human  reader,  unless  a  space 
existed  between  the  letters. 

In  -  h 

nn  -  m 

1  o  -  b 

ol  -  d 

Figure  2.6  Letter  Combinations  D i s t i nqu i shab 1 e  Using 

Width  of  VCL  Regions 

Besides  constant  width  of  VCL  regions,  height  of  VCL 

regions  were  also  used  as  a  segmentation  clues.  Take  for 
example  the  letter  combination  "rh".  This  letter 

combination  consist  of  two  A  regions  on  either  side  of  a  VCL 
region.  If  a  gap  is  not  detected,  the  same  features  would 
be  detected  in  "rn"  ,  "nn",  "rln",  or  "m".  By  noting  the 

greater  height  of  the  middle  VCL  region  and  the  nonexistence 
of  a  letter  that  looks  like  the  mirror  image  of  an  "h"  ,  the 

algorithm  can  not  only  segment  this  letter  combination  but 
also  determine  that  it  is  due  to  "rh"  or  "rln".  Using  width 
of  VCL  regions,  the  algorithm  can  further  determine  that  the 
letter  combinations  is  "rh"  and  not  "rln". 


2.3  NONEXISTENT  LETTER  FEATURES  AS  SEGMENTATION  CLUES 


advantage  of  the  nonexistence  of  features  in  a  single  letter 


of  the  English  alphabet.  The  following  list  of  nonexistent 
letter  features  were  considered  in  the  final  segsentation 
decision  algorithn.  The  long  vertical  lines  in  these 
letters  could  be  due  to  adjacent  letters  such  as 
b , h , d , k , 1 , g , p ,  or  q.  In  all  these  cases,  the  segmentation 
point  is  in  between  the  A,0,U,  or  8  regions  and  the  longer 
vert ical  1 ine  . 


Nonexistent  Features 
A-TALL  VCL 
TALL  VCL-A 
TALL  VCL-O 
8-TALL  VCL 

TALL  VCL-8 

U-TALL  VCL 
TALL  VCL-U 


Example  Cases 
nl  nj  rl  rj 

r  t 

lc  p 

al  aj  eJ  e|  si  s| 
z!  2| 

la  m  le  p  Is  is 

Iz  fZ 

J  uj  v|  vl 

lu  pi  Iv  [v 


Figure  2.7  Cases  Segmented  Using  Nonexistent  Features 


One  other  feature  that  does  not  exist  in  a  single 
letter  is  a  vertical  line  that  is  longer  than  the  length  of 
tall  letters  such  as  the  letter  "1".  Using  this 
information,  letters  are  segmented  where  a  VCL  region  exists 
that  is  too  tall.  These  cases  exist  when  a  tall  letter  and 
a  letter  which  has  a  vertical  line  below  the  line  of  print 
are  adjacent  to  each  other.  These  cases  are  listed  in 
Figure  2.8  below.  This  segmentation  clue  would  be  valid 


even  if  the  width  of  the  VCL  regions  was  not  wide  enough  for 
two  adjacent  letters.  In  all  these  cases,  the  segmentation 
point  is  the  middle  of  the  tall  VCL  region. 

Nonexistent  Letter  Feature  Example  Cases 


VCL  region  taller 

than  "1" 

gb 

gb 

gk 

gl 

qb 

qh 

qk 

q  1 

jb 

jh 

jk 

jl 

dp 

ip 

Figure  2.8 

Cases  Segmented 

U  s  i  ng 

Length 

o  f 

VCL 

2.4  DETECT  PROBLEM  LETTERS 

Two  individual  letter  detection  programs  were  also 
written  to  reduce  the  chances  of  error.  A  "t"  detector  was 
written  because  the  letter  "t"  is  the  second  most  frequent 
letter  in  the  English  language.  In  addition,  the  previous 
segmentation  clues  did  not  adequately  distinquish  between 
VCL  and  A  regions  in  a  "t"  to  those  found  in  "h","n" ,"m" , 
and  "r".  Because  the  " t ”  detector  searches  for  the  cross 
shape  in  a  "t",  it  also  detects  the  presence  of  an  "f". 
Also,  an  "i"  detector  was  written  because  the  letter  ”i" 
occurs  with  relatively  high  probability  and  the  VCL  in  an 
"i"  could  not  be  distinquished  from  the  VCL  in  other 
letters.  For  example,  a  "ci"  and  a  "ri"  combination  with  no 
space  in  between  the  letters  could  look  like  "a"  and  "n", 
respectively.  The  "i"  detector,  which  also  detects  the 
letter  "j",  is  used  to  segment  all  letter  combinations 


involving  "t"  or  "j". 


The  "t"  detector  is  used  to  segment 


all  letter  combinations  involving  " t "  or  "f". 


.5  FINAL  DECISION  RULES 

General  decision  rules  have  been  mentioned  above  along 
with  specific  examples.  In  deriving  the  final  decision 
rules  for  all  the  two  letter  combinations  consisting  of 
A,0,U ,  or  8  regions  on  either  side  of  a  VCL  region,  each 
case  had  to  be  considered  individually  to  derive  more 
specific  segmentation  decision  rules.  Additional  cases  are 
those  involving  a  A,0,U,  or  8  region  on  either  side  of  the 
letter  " 1 "  or  tall  VCL.  The  first  two  pages  list  more 
general  decision  rules  based  on  various  adjacent  regions  and 
width  of  VCL.  The  rest  of  the  decision  rules  are  more 
specific  and  consider  nonexistent  letter  features.  All 
these  cases  are  listed  on  the  following  pages.  The  letter 
combinations  involving  a  "i",  "j",  "f"  or  "t"  are  omitted 

because  these  cases  are  segmented  using  the  "i"  detector  and 
"t"  detector.  In  listing  the  different  cases,  two  distinct 
shapes  for  the  letters  a,  g,  and  y  were  recognized.  These 
different  shapes  are  shown  in  Figure  2.9  below. 

a  -  a 

8-9 
y  -  y 

Figure  2.9  Letters  Commonly  Printed  in  Two  Distinct 


Also,  the  following  abbreviations  were  used  in  explaining 
the  segmentation  clues  for  the  different  letter  combinations 
listed  on  the  following  pages: 

thick  -  VCL  region  too  wide  to  be  caused  by  one  single 
letter 

thin  -  VCL  region  just  wide  enough  for  one  letter 

PVC  -  (  previous  VCL  )  VCL  region  preceding  the  VCL  region 

be  considered  for  segmentation 

NVC  -  (  next  VCL  )  VCL  region  immediately  following  VCL 

region  being  consider  for  segmentation 

TS  -  position  which  marks  top  of  small  letters  such  as 
a , c , e  ,  or  o 

BS  -  position  which  marks  bottom  of  small  letters  such  as 
a , c , e ,  or  o 

TALL-VCL  -  VCL  region  whose  height  is  taller  than  the 
distance  from  TS  to  BS  such  as  the  letter  "l" 

G  -  8-region  which  extends  below  the  line  of  print  as  in  g 


A/U  - 

represents 

a  A  and 

U  -  r eg i on 

which 

over  lap 

each 

other 

U/ A  - 

represents 

a  U  and 

A-region 

wh  i  c  h 

overlap 

each 

other 

DECISION  RULES 


FEATURES 

CASES 

8- VCL-0 

a  b 

ac 

ad 

ap 

thick. 

9Q 

5b 

3C 

3d 

8- VCL-0 

e  q 

e  b 

e  c 

e  d 

thin 

8  O 

s  b 

s  c 

s  d 

go 

gb 

gc 

gd 

Z  Q 

z  b 

z  c 

z  d 

O-VCL-O 

QQ 

Qb 

ac 

ad 

thick 

bo 

bb 

b  c 

bd 

d  d 

db 

d  c 

dd 

oo 

o  b 

oc 

od 

po 

pb 

pc 

pd 

qo 

qb 

qc 

qd 

O-VCL-O 

C  Q 

c  b 

c  c 

c  d 

thin 

ka 

kb 

kc 

kd 

z  a 

z  b 

z  c 

zd 

U-VCL-0 

UQ 

u  b 

u  c 

ud 

thick 

3b 

yd 

U-VCL-0 

VQ 

v  b 

V  c 

vd 

thin 

wa 

wb 

wc 

wd 

ya 

yb 

yc 

yd 

A- VCL-0 

r  q 

r  b 

r  c 

r  d 

thin 

va 

v  b 

v  c 

vd 

wa 

w  b 

wc 

wd 

ya 

yb 

yc 

yd 

A- VCL-O 

t 

thin 

A- VCL-0 

ha 

hb 

he 

hd 

thick 

ma 

m  b 

me 

md 

SEGMENTATION  DECISION 


ao 

aq 

middle 

VCL 

9P 

s° 

ep 

eo 

eq 

before 

VCL 

sp 

s  o 

sq 

gP 

go 

gd 

zp 

zo 

zq 

ap 

ao 

aq 

mi dd 1 e 

VCL 

bp 

bo 

bd 

dp 

do 

dd 

op 

oo 

oq 

PP 

po 

Pd 

dd 

qo 

dd 

cq 

c  o 

Cd 

before 

VCL 

kq 

ko 

kq 

zq 

z  o 

zq 

uo 

up 

Ud 

middle 

VCL 

yo 

yP 

yd 

VO 

vp 

vq 

before 

VCL 

wo 

wp 

wq 

yo 

yp 

yd 

r  o 

rp 

r  q 

before  VCL 

VO 

vp 

vq 

wo 

w  p 

wq 

yo 

yp 

yd 

do  nothing 

ho  h  p 

hq 

middl e  VCL 

mo  mp 

mq 

8-VCL-A 

e  h 

em 

e  n 

e  r 

be  fore 

VC  L 

thin 

sh 

s  m 

s  n 

s  r 

zh 

zm 

z  n 

z  r 

8-VCL-A 

ah 

am 

an 

a  r 

middle 

VCL 

thick 

gh 

gm 

gn 

gr 

gh 

gm 

gn 

9r 

A-VCL-A 

rh 

r  m 

r  n 

r  r 

before 

VCL 

thin 

vh 

vm 

vn 

v  r 

wh 

wm 

wn 

wr 

yh 

y  m 

yn 

y* 

A-VCL-A 

hh 

hm 

hn 

hr 

middle 

VCL 

thick 

mh 

mm 

mn 

ror 

nh 

nm 

nn 

nr 

A-VCL-A 

m 

and 

some 

t  *  8 

do  nothing 

thin 


U-VCL-A 

uh 

urn 

un 

ur 

middle 

VCL 

thick 

y  m 

yn 

U-VCL-A 

vh 

vm 

vn 

vr 

before 

VCL 

thin 

wh 

wm 

wn 

wr 

The  Following  Cases  Need  Additional  Segmen t a t i on  Clues 
to  Distinguish  Letter  Combinations  Marked  * 


A-VCL-B 

he 

hg 

h9 

hz 

middle  VCL 

thick 

me 

mg 

mg 

mz 

ne 

ng 

n9 

nz 

A-VCL-8 

h  s 

ms 

n  s 

after  VCL 

thin 

ha 

ma 

na 

h  z 

mz 

nz 

A-VCL-8 

r  e 

fg 

r9 

before  VCL 

thin  * 


O-VCL-8 

oe 

°9 

og 

middle  VCL 

thick 

be 

b 9 

bg 

de 

dS 

dg 

oe 

°9 

og 

pe 

P9 

Pg 

qe 

«9 

qg 

O-VCL-8 

OS 

after  VCL 

thin 

ba 

b  s 

bz 

da 

ds 

bz 

oa 

OS 

b  z 

pa 

ps 

pz 

qa 

qs 

qz 

O-VCL-8 

c  e 

eg 

c9 

before  VCL 

thin  * 


8-VCL-8 

aa 

as 

a  z 

after 

VCL 

thin  * 

g* 

gs 

gz 

8-VCL-8 

ae 

a9 

a8 

midd le 

VCL 

thick 

9e 

8-VCL-8 

ee 

eg 

e9 

be  f  ore 

VCL 

thin 

se 

sg 

8e 

ze 

zg 

z9 

U-VCL-8 

ue 

Ug 

u9 

m i dd 1 e 

VCL 

thick 

9* 

yg 

See  pages  marked  "  Specific  Segmentation  Rules 


U-VCL-8 

u  a 

u  s 

U  Z 

after  VCL 

thin 

ys 

yz 

U-VCL-8 

ve 

Vg 

vy 

V  z 

before  VCL 

thin  * 

we 

Wg 

wq 

wz 

ye 

yg 

V3 

yz 

U-VCL-U 

thick 

uu 

uy 

3U 

middle  VCL 

U-VCL-U 

thin 

U  V 

uw 

uy 

after  VCL 

U-VCL-U 

V  u 

vu 

yu 

before  VCL 

thin  * 

vb 

wq 

O-VCL-U 

QU 

bu 

du 

ou 

pu 

qu 

middle  VCL 

thick 

bb 

dy 

°y 

Py 

<»y 

O-VCL-U 

QV 

b  v 

d  v 

O  V 

pv 

qv 

after  VCL 

thin 

QW 

b  w 

du 

o  w 

pw 

qw 

ay 

by 

dy 

oy 

py 

dy 

O-VCL-U 
thin  * 

c  u 

Cy 

before  VCL 

A - VCL- U 

hu 

mu 

nu 

middle  VCL 

thick 

ny 

ny 

A-VCL-U 
thin  * 

r  u 

rb 

bef  ore  VCL 

A-VCL-U 

h  v 

m  v 

n  v 

after  VCL 

thin 

hw 

mw 

nw 

hy 

my 

ny 

8-VCL-U 

au 

middle 

VCL 

thick 

ay 

8-VCL-U 

eu 

gu 

su  z  u 

before 

VCL 

thin 

3q  Zsj 

8-VCL-U 

a  v 

yv 

after 

VCL 

thin  * 

aw 

q  w 

ay 

<jy 

*  See 

pages  marked 

"  Specific 

Segmentation  Rules  " 

O-VCL-A 
thin  * 

ch 

cm 

c  n 

c  r 

before  VCL 

O-VCL-A 

av 

b  v 

o  v 

pv 

qv 

after  VCL 

thin 

QW 

bw 

o  w 

pw 

q  w 

ay 

by 

°y 

py 

qy 

O-VCL-A 

ah 

am 

an 

ar 

m  i  dd 1 e  VCL 

thick 

bh 

bm 

bn 

br 

dh 

dm 

dn 

dr 

SPECIFIC  SEGMENTATION  RULES 


ADDITIONAL  RULES  CASES  SEGMENTATION  DECISION 

A-VCL-8,  thin  VCL 

Height  PVC  ■  (  TS-BS  )  r  g  rg  before  VCL 

8  region  extends  below  BS 
No  A  region  before  thin  PVC 

Height  PVC  -  (  TS-BS  )  re  before  VCL 

Height  VCL  <  (  BS-TS  ) 

No  A  region  before  thin  PVC 

If  8  region  between  TS,BS,  no  thick.  NVC 

Space  2nd  edge  8  region 


O-VCL-8,  thin  VCL 

Height  PVC  <  (  BS-TS  )  ce  before  VCL 

VCL  or  PVC  not  <  BS  or  >  TS 

If  8  region  between  TS , BS ,  no  thick  NVC 

Space  2nd  edge  8  region 

Height  PVC  <  (  BS-TS  )  eg  eg  before  VCL 

8  region  extend  below  BS 
VCL  or  PVC  not  <  BS  or  >  TS 


8-VCL-8,  thin  VCL 

1st  8  region  extends  below  BS  ga  gs  after  VCL 

No  PVC  at  1st  edge  of  1st  8  region  aa  after  VCL 

No  thick  PVC 

NVC  2nd  edge  2nd  8  region 
No  2nd  8  region  below  BS 

No  PVC  at  1st  edge  of  1st  8  region  as  after  VCL 

No  thick  PVC 

Space  after  2nd  8  region 
Space  1st  edge  1st  8  region 
No  2nd  8  region  below  BS 
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U-VCL-8,  thin  VCL 


8 

r  eg  i  on 

extends  below  BS 

vg 

VC, 

bef  ore 

VCL 

No 

thick 

PVC 

wg 

WG 

A 

region 

bef  ore  VCL 

y& 

y$ 

No 

PVC  at 

1st  edge  U  region 

ve 

we  ye 

before 

VCL 

No 

thick 

PVC 

No 

thick 

NVC 

No  NVC  2nd  edge  8  region 
A  region  before  VCL 


U-VCL-U,  thin  VCL 

No  thick  PVC  vu  wu  yu  before  VCL 

No  VCL  1st  edge  1st  U  region  Vy  wy 

Thick  NVC 

NVC  2nd  edge  2nd  U  region 
A  region  before  VCL 


O-VCL-U,  thin  VCL 

No  PVC  or  VCL  >  TS  or  <  BS  cu  cy  before  VCL 

NVC  thick 

NVC  2nd  edge  U  region 
NVC  <  BS 


A - VCL- U  ,  thin  VCL 

PVC  -  <  BS-TS  )  ru  ry  before  VCL 

NVC  <  BS 
NVC  thick 

NVC  2nd  edge  U  region 


8-VCL-U,  thin  VCL 

No  thick  NVC  gv  gw  gy  after  VCL 

No  NVC  2nd  edge  U  region 
VCL  <  BS 

8  region  extends  below  BS 
PVC  1st  edge  8  region 
Thick  PVC 

No  thick  PVC  av  aw  ay  after  VCL 

No  8  region  extending 
be  1 ow  BS  . 

A  region  after  VCL 
Space  1st  edge  8  region 


TALL  VCL-A 
thick 

lh 

1m 

1  n 

lr 

a  idd  1  e 

VCL 

TALL  VCL-A 

thin 

1  V 

1  w 

iy 

after 

VCL 

TALL  VCL-A 
thin  * 

h 

do  nothing 

TALL  VCL-0 
thick 

la 

lb 

1  c 

Id 

1  o 

ip  iq 

middle 

VCL 

TALL  VCL-U 

thick 

lu 

m  i  dd 1 e 

VCL 

TALL  VCL-U 

thin 

lv 

1  w 

iy 

after 

VCL 

TALL  VCL-8 

thick 

le 

*9 

ig 

■idd 1 e 

VCL 

TALL  VCL-8 

thin 

1  a 

1  S 

1  z 

*6 

after 

VCL 

A-TALL 

thick 

VCL 

hi 

ml 

nl 

middle 

VCL 

A-TALL 

thin 

VC  L 

r  1 

vl 

vl 

y 1 

bef  ore 

VCL 

O-TALL 

thick 

VCL 

al 

bl 

dl 

o  1 

pi 

qi 

middle 

VCL 

O-TALL 

thin 

VCL 

c  1 

before 

VCL 

U-TALL 

thick 

VCL 

ul 

y1 

middle 

VCL 

U-TALL 

thin 

VCL 

vl 

vl 

y  i 

bef  ore 

VCL 

8-TALL 

thick 

VCL 

a  1 

91 

middle 

VCL 

8-TALL 

VCL 

el 

Cl 

8  1 

z  1 

bef  ore 

VCL 

ENHANCED 

DECISION 

RULES 

FEATURES 

CASES 

SEGMENTATION  DECISION 

8- VCL-0 

a  b 

ac 

ad 

ap 

a  o 

aq 

middle  VCL 

thick. 

8-VCL-O 

ea 

e  b 

ec 

ed 

ep 

eo 

eq 

bef  ore  VCL 

thin 

s  a 

s  b 

s  c 

s  d 

sp 

s  o 

sq 

ZQ 

z  b 

z  c 

z  d 

zp 

zo 

zq 

O- VCL-0 

aa 

ab 

ac 

ad 

QP 

QO 

aq 

middle  VCL 

thick 

ba 

bb 

b  c 

bd 

bp 

bo 

bq 

d  a 

db 

d  c 

dd 

dp 

do 

dq 

oa 

ob 

o  c 

od 

op 

oo 

oq 

pa 

pb 

pc 

pd 

PP 

po 

pq 

Q  a 

qb 

qc 

qd 

qq 

qo 

qq 

O-VCL-O 

c  a 

c  b 

c  c 

cd 

cq 

CO 

cq 

be  fore  VCL 

thin 

k  a 

kb 

kc 

kd 

icq 

ko 

Wq 

za 

z  b 

zc 

z  d 

zq 

zo 

zq 

U- VCL-0 

ua 

ub 

uc 

ud 

U  O 

up 

uq 

middle  VCL 

thick 

yb 

^c 

Vjd 

cc 

0 

yp 

yq 

U/A-VCL-O 

VQ 

vb 

VC 

vd 

VO 

vp 

vq 

before  VCL 

thin 

WQ 

ub 

wc 

wd 

WO 

wp 

wq 

ya 

yb 

yc 

yd 

yo 

yp 

yq 

A-VCL-0 

r  q 

rb 

r  c 

r  d 

r  o 

rp 

r  q 

bef  ore  VCL 

thin 

A- VCL-O 

t 

do  nothing 

thin 

A-VCL-0 

ha 

hb 

he 

hd 

ho 

hp 

hq 

mi dd 1 e  VCL 

8-VCL-A 

Chin 


before  VCL 


8-VCL-A 

eh 

em 

en 

e  r 

bef  ore 

VCL 

thin 

sh 

sm 

a  n 

s  r 

zh 

zm 

zn 

z  r 

8-VCL-A 

thick 

ah 

am 

an 

ar 

middle 

VCL 

A-VCL-A 

r  h 

rm 

r  n 

r  r 

bef  ore 

VCL 

thin 

A-VCL-A 

hh 

hm 

hn 

h  r 

mi dd 1 e 

VCL 

thick 

mh 

mm 

mn 

mr 

nh 

nm 

nn 

nr 

A-VCL-A 

m 

and 

some 

t  '  s 

do  nothing 

thin 


U-VCL-A 

uh 

um 

un 

ur 

middle 

VCL 

thick 

yh 

y  m 

yn 

3r 

U/A-VCL-A 

vh 

vm 

vn 

v  r 

bef  ore 

VCL 

thin 

wh 

wm 

wn 

wr 

yh 

y  m 

yn 

yf 

A-VCL-G 

hg 

™8 

ng 

middle 

VCL 

thick 

hg 

mg 

n9 

A-VCL-G 

r8 

r9 

O-VCL-G 

°g 

*>g 

dg 

og  pg  qg 

bef  ore 

VCL 

thick 

og 

bg 

dg 
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iii.  !9EiyARE  development 

The  following  chapter  explains  the  software  which 
implements  the  segmentation  algorithm  described  in  Chapter 
2.  This  chapter  is  divided  into  three  sections.  The  first 
section  describes  the  software,  written  by  AFIT  faculty,  to 
store  a  digitized  image  into  a  video  file  and  to  manipulate 
the  individual  pixels  in  that  file.  The  second  section 
describes  MAIN.FR,  the  main  calling  program  which  calls  all 
the  subroutines  that  proc>-sses  the  digitized  image  to 
produce  segmentation  clues  described  in  Chapter  2.  The 
final  section  describes  the  program,  DECALG.FR,  which  uses 
all  the  segmentation  clues  to  decide  the  segmentation  points 
of  individual  letters  in  a  word. 

3.1  DIGITIZATION  SOFTWARE 

The  Octek  program  and  the  PICBUF  routines,  written  by 
Captain  King,  were  used  extensively  as  the  main  tool  for 
capturing  and  processing  a  digitized  image  of  a  word  (  8  ). 

The  Octek  program  is  a  user  friendly  program  which 
interfaces  with  the  hardware  of  the  video  camera  and  the 
Octek  2000  Image  Analyzer  board.  The  Octek  program  enables 
the  user  to  threshold  a  digital  image  ,  to  interactively 
position  a  cursor  to  find  coordinates  of  significant  points 
on  the  monitor  screen,  and  store  the  image  in  a  video  file. 

PICBUF  is  a  library  of  Fortran  callable  subroutines 
which  hide  the  system  dependent  details  of  manipulating 
pixels  in  a  digital  image.  The  PICBUF  subroutines  were  used 


to  read  the  value  of  individual  pixels  or  an  entire  row  of 
pixels  from  a  video  file.  The  PICBUF  routines  were  also 
used  to  write  to  a  video  file  in  order  to  display,  in  the 
form  of  a  digital  image,  the  results  of  a  software  program. 

3.2  SEGMENTATION  CLUES  SOFTWARE 

The  following  section  discusses  the  software  which 
determines  the  segmentation  clues  explained  in  Chapter  2. 
Assume  that  the  thresholded  (  binarized  )  digital  image  of  a 
word  has  already  been  stored  in  a  video  file  and  that  the 
coordinates  of  the  word  have  been  recorded  using  the  OCTEK 
program . 

The  program,  MAIN.FR,  prompts  the  user  for  the  name  of 
the  video  file  and  the  coordinates  of  a  word  to  be 
segmented.  The  program  then  stores  the  digital  image  of  the 
word  in  a  buffer  using  the  PICBUF  routines.  Next,  MAIN 
calls  nine  subroutines  and  several  PICBUF  routines  which 
implement  the  segmentation  algorithm  and  output  the  results 
of  each  subroutine  into  a  video  file  for  display.  The 
following  paragraphs  explain  each  of  the  segmentation 
subroutines  called  by  MAIN. 

VRTPROJ  (  vertical  projection  )  counts  the  number  of 
pixels  in  each  column  of  the  digital  word  and  puts  the 
result  in  an  output  buffer  for  display.  This  routine  also 
locates  columns  which  do  not  contain  black  pixels.  These 
columns  indicate  spaces  or  gaps  between  letters  in  a  word. 
Three  types  of  gaps  are  located.  These  "GAP's"  are  weighted 


arbitrarily  for  display  purposes  and  also  to  indicate 


confidence  in  the  location  of  letter  edges. 


1.  1  pixel  in  column  at  local  minimum  (  GAP  of  weight  S  ) 

2.  No  pixels  in  column  (  GAP  of  weight  10  ) 

3.  No  pixels  in  two  adjacent  columns  (  GAP  of  weight  IS  ) 

GAP's  of  weight  10  or  IS  are  used  to  locate 
segmentation  points  and  also  to  identify  adjacent  letters  to 
determine  neighboring  segmentation  points.  GAP's  of  weight 
S  have  not  been  incorporated  in  the  segmentation  decision 
algorithm.  Figure  3.1  below  indicates  two  types  of  gaps 
found  in  the  word  "  mat  ". 


□r  i  gi  rial 

Digitized 

Word 


Pixels  /  Column 


pixels  In  each  row  of  the  digital  word  and  puts  the  result 
In  an  output  buffer  for  display.  The  array  HPROJ  which 
contains  the  number  of  pixels  per  row  is  passed  to  FDLHT. 

FDLHT  (  find  letter  height  >  finds  the  heights  of 
various  letters  by  taking  advantage  of  the  greater 
concentrations  of  black  pixels  per  row  that  occur  at  the 
location  of  small  letters  such  as  a.o.u.e,  etc.  The  various 
letter  heights  are  explained  below. 

TT  -  top  of  tall  letters  such  as  b.d.f.h.k.l  or  t. 

TS  -  top  of  small  letters  such  as  c>e,g>pt  or  o. 

BS  -  bottom  of  small  letters  such  as  a,c,et  or  o. 

VBS  -  very  bottom  of  small  letters  such  as  g>J,pl  or  q. 

The  letter  height  values  are  used  in  the  final  decision  rule 
to  determine  whether  a  detected  VCL  is  due  to  a  specific 
letter.  See  Figure  3.2  below. 


Figure  3 . 2 


Display  Showing  Projection  of  Pixels 
And  Three  Detected  Letter  Heights 


DVCLIN  (  detect  continuous  vertical  lines  )  detects 
the  longest  vertical  black  line,  which  does  not  have  a  space 
of  two  or  more  pixels,  „for  each  column  in  the  digital  image 
of  a  word.  This  subroutine  detects  the  VCL  regions 
mentioned  in  Chapter  2. 

EDVCL  (  edge  of  vertical  continuous  lines  )  determines 
the  edges  of  significant  VCL  regions  found  in  DVCLIN. 
Significant  VCL  regions  are  defined  as  those  which  have  a 
height  that  is  greater  than  or  equal  to  1/2  the  distance 
from  TS  to  BS.  See  Figure  3.3  below. 
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greater  than  the  distance  from  TS  to  BS  and  which  have  a 
continuous  horizontal  line  passing  through  the  VCL  region  at 
a  height  near  TS.  FINDT  also  determines  the  edges  of  two 
adjacent  "t/f"  's  or  the  edges  of  a  single  "t/f"  overlapping 
other  letters. 

IFIND  (  "i"  find  )  finds  the  location  of  letters  "i"  or 
"j"  by  looking  for  a  VCL  region  greater  than  or  equal  to  the 
distance  from  TS  to  BS  and  which  has  a  concentration  of 
pixels  above  the  the  VCL  region. 

DTOUAE  (  detect  “o” , "u" , arches ,  and  "e"  )  detects  the 
presence  of  OtU>A>  and  8  regions  defined  in  the  Theoretical 
Development.  This  subroutine  calls  four  additional 
subroutines.  The  first  subroutine,  DELOUA  (  delete  ”  o  ”  ,  "  u  ”  , 
and  arches  ),  deletes  all  0,U,  and  A-reglons  which  are  too 
short  or  too  narrow  to  be  caused  by  a  legitimate  letter. 
The  second  subroutine  called  is  DTEG  (  detect  "e"  or  "g"  ). 
DTEG  looks  at  all  regions  which  meet  the  conditions  for  0 - 
regions  and  also  meet  the  conditions  for  8  regions.  DTEG 
then  looks  at  each  8-region  and  0  -regions  and  assigns  it  to 
one  of  six  categories  based  on  its  vertical  position 
relative  to  the  various  letter  heights.  These  four  types  of 
O-regions  and  two  types  of  8-regions  can  be  used  to  help 
recognize  the  letters  in  order  to  better  determine  the 
segmentation  location.  Similiarly,  the  last  two  subroutines 
called  by  DTOUAE,  DTA  and  DTU,  find  two  different  types  of  A 


and  U-  regions 


respectively 


The  edges  of  these  regions 


are  put  in  an  output  buffer  for  display 


The  various  types 


of  O ,  8 , 


U,  and  A  regions  and  their  assigned  numbers  for 


display  are  given  below. 
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HTVCL  (  height  of  VCL  )  divides  the  VCL  regions  into 
two  halves.  This  subroutine  then  finds  the  maximum  position 
and  mininum  position  of  the  vertical  lines  contained  In  the 
two  halves  of  the  VCL  regions.  The  VCL  regions  are  divided 
in  half  to  account  for  cases  where  the  touching  vertical 
lines  of  two  adjacent  letters  have  two  different  lengths  or 
positions  as  in  the  case  of  "gn"  or  "gh”,  respectively. 
These  heights  and  positions  of  the  vertical  lines  in  the 
digitized  image  of  the  word  are  used  as  segmentation  clues 
in  the  decision  algorithm. 

3.3  SEGMENTATION  DECISION  SOFTWARE 

The  last  subroutine  MAIN  calls  is  DECALG  which  uses  all 


the  segmentation  clues  to  decide  where  the  letters  in  the 


word  should  be  segmented.  DECALG  gather  all  segmentation 
clues  for  each  VCL  into  a  one  two  dimension  array  IVCL.  The 
height  of  each  VCL  and  the  presence  of  gap«  are  already 
stored  in  IVCL  when  HTVCL  and  VRTPROG  are  called, 
respectively.  Another  column  in  IVCL  is  used  to  indicate 
whether  the  VCL  is  either  part  of  a  "t"  ,  near  a  "t"  ,  part 
of  a  "i" ,  part  of  a  "v",  or  part  of  a  "x".  The  DECALG  then 
considers  each  VCL  in  the  word  individually  while 
determining  what  regions  are  on  both  sides  of  the  VCL.  The 
program  then  makes  a  segmentation  decision  based  on  the 
decision  rules  outlined  in  Chapter  2.  The  segmentation 
location  is  either  at  the  middle,  the  first  edge,  or  the 
second  edge  of  the  VCL  region.  For  those  cases  where  the 
segmentation  clues  do  not  yield  enough  information  to  decide 
whether  to  segment  before  or  after  the  VCL  region,  the 
individual  letter  recognizer  would  have  to  consider  both 
segmentation  locations  and  determine  which  one  leaves 
legitimate  letters  on  both  sides  of  the  segmentation  point. 
Vertical  lines  are  used  to  indicate  below  the  word  the 
segmentation  points  decided  by  DECALG. 


IV.  VALIDATION 

In  testing  the  validity  of  the  gegsentation  decision 
algorithm,  handwritten  letters  were  used  to  simulate  noise 
conditions.  Time  did  not  allow  the  testing  of  other  types 
of  print  because  more  time  was  needed  to  perfect  the 
decision  rules  and  investigate  more  segmentation  clues.  In 
all  four  of  the  examples  shown,  the  letters  were 
deliberately  written  so  that,  in  the  digitized  image  of  the 
word,  each  letter  was  not  separated  by  a  column  which 
contained  just  white  pixels.  Also,  no  care  was  taken  in 
writing  the  letters  with  perfectly  vertical  or  unbroken 
lines.  Looking  at  the  word  "the",  the  vertical  lines  in  the 
letters  "t"  and  "h"  are  slanted  slightly  to  the  right. 
Looking  at  the  word  "had",  the  vertical  line  in  the  letter 
"d"  has  two  breaks  in  it. 

The  following  four  cases  are  presented  to  demonstrate 
the  capabilities  of  the  segmentation  algorithm  as  discussed 
in  Chapter  2.  Except  for  the  word  "the",  the  four  words  were 
not  chosen  for  any  particular  reason.  The  word  "the"  was 
chosen  because  it  represents  the  most  common  word  in  the 
English  language  (  3  ).  Vertical  lines  are  used  to  mark  the 
segmentation  points  below  the  digitized  image  of  the  word. 
The  height  of  the  vertical  line  indicates  the  relative 
confidence  of  the  segmentation  decision.  If  two  clues 
indicate  a  particular  segmentation  point  a  height  of  10 
pixels  is  given  to  the  vertical  line.  The  location  of  "i" 
or  " t "  is  also  given  a  10  pixels  segmentation  mark.  If  only 


40 


one  clue  indicates  a  particular  segmentation  point  a  height 
of  5  pixels  is  given  to  the  vertical  line.  In  all  cases  the 
correct  segmentation  point  was  chosen  based  on  the 
segmentation  decision  software.  However,  more  rules  need  to 
added  to  the  software.  In  the  word  "  zebra  "  a  U-region  was 
found  to  exist  at  the  1st  edge  of  the  VCL  in  "e"  which  led 
to  an  incorrect  segmentation  location.  This  is  easily 
corrected  though  if  a  U-region  is  required  to  have  VCL's  on 
both  sides  of  the  region.  Alternatively,  a  U-region  which 
does  not  have  VCL's  on  both  sides  which  are  between  TS  and 
BS  could  be  used  as  a  segmentation  clue  since  it  represents 
an  nonexistent  letter  feature. 

Figures  4.1,  4.2,  4.3  and  4.4  show  the  segmentation 
points,  the  edges  of  the  VCL  regions,  and  the  edges  of  the 
A , 0  ,  U , 8  regions  for  the  four  cases.  In  the  word  "hod",  a 
higher  confidence  was  given  to  the  first  segmentation  point 
because  the  vertical  line  in  the  "h"  was  determined  to  be 
greater  than  TS  indicating  that  the  A  region  was  not  due  to 
an  "r"  or  an  "n"  and  that  the  VCL  region  between  the  "h"  and 
the  "o"  should  be  segmented  at  the  middle.  If  the  tall 
vertical  line  of  the  "h"  had  not  been  detected  the  algorithm 
could  not  determine  with  confidence  whether  the  A  region  was 
due  to  an  "r"  and  segment  the  letter  combination  at  the 
first  edge  of  the  VCL  region  or  whether  the  A  region  was  due 
to  an  "n"  and  segment  at  the  middle.  In  the  case  of  the 
words  "mat"  and  "the",  high  confidence  segmentation  marks 


were  indicated  for  the  edges  of  the  two  *' t  '  s  detected  by 
FINDT.  Further  validity  testing  needs  to  be  conducted  using 
other  letter  combinations  considered  in  the  theoretical 
development  of  the  segmentation  algorithm. 
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li.  conclusion  and  recommendations 

The  results  in  Chapter  4  indicate  that  it  is  possible 
to  segment  touching  English  letters  based  on  the  length  and 
position  of  vertical  lines,  and  VCL,  A,  0,  U,  and  8  regions. 
As  mentioned  in  the  introduction,  it  almost  necessary  to 
recognize  the  individual  letters  in  order  to  segment  letters 
which  are  not  separated  by  a  space.  In  effect,  this  thesis 
work  has  shown  this  to  be  true  because  the  same  features 
which  were  used  to  segment  the  letters  could  be  used  to  help 
recognize  the  individual  letters.  This  algorithm  could  be 
simplified  and  made  more  reliable  by  considering 
probabilities  and  locations  of  letter  combinations.  Many  of 
the  letter  combinations  considered  in  Chapter  2  may  never 
occur  in  the  English  language.  Also  knowing  where  letter 
combinations  can  occur  in  a  word  could  also  improve  the 
reliability.  For  example,  by  knowing  that  a  "rn" 

combination  can  not  begin  a  word,  the  segmentation  algorithm 
could  avoid  falsely  segmenting  a  "m"  at  the  beginning  of  a 
word.  One  could  also  improve  reliability  by  avoiding 
segmenting  letter  combinations,  which  proved  difficult  to 
segment,  by  considering  these  letter  combinations  as  a 
single  "new”  letter.  The  "new"  letter  could  then  be 
recognized  in  the  same  manner  as  the  other  single  letters  in 
the  alphabet. 

In  testing  this  algorithm,  no  spacing  between  any  of 
the  letters  were  assumed  in  order  to  create  a  worst  case 


condition.  In  reality,  spaces  between  some  letters  in  a  word 
do  occur.  These  spaces  and  the  detected  letter  features 
could  be  used  to  recognize  a  letter  which  is  touching  an 
adjacent  letter.  By  knowing  one  of  the  letters  in  a 
touching  letter  combination,  the  segmentation  algorithm 
could  more  reliably  segment  the  letter  combination. 

The  work  done  in  this  thesis  is  just  a  starting  point. 
Similiar  software  needs  to  be  written  to  account  for  upper 
case  print.  Preprocessing  software  needs  to  be  written  to 
ignore  "  tails  "  at  the  edge  of  printed  fonts.  A  pattern 
recognition  approach  has  to  be  chosen  to  recognize  the 
individual  letters.  Already,  extensive  work  has  been  done  at 
AFIT  recognizing  individual  letters  using  two  dimensional 
Fast  Fourier  Transforms.  Also,  extensive  work  has  been  done 
by  IBM  in  the  area  of  postprocessing  to  correct  spelling 
mistakes  or  to  predict  a  garbled  letter  in  the  context  of  a 
word  (  2  ).  The  foundations  are  already  present  for  AFIT  to 


someday  demonstrate  a  reliable  reading  machine. 
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£************************************************************ 
C  SUBROUTINE  NAME:  MAIN  PROGRAM  (  MAIN.FR  ) 

C  WRITTEN  BY:  1 LT .  DAVID  V.  SOBOTA 

C  PURPOSE:  THIS  IS  THE  MAIN  CALLING  PROGRAM  WHICH  ASK  THE 
C  USER  FOR  THE  FILE  NAME  AND  COORDINATES  OF  THE  WORD  TO  BE 
C  SEGMENTED.  THIS  PROGRAM  THEN  CALLS  ALL  THE  SUBROUTINES 
C  AND  DISPLAYS  THE  BINARY  WORD  AND  A  DISPLAY  OF  ALL 
C  SEGMENTATION  CLUES  DETERMINE  BY  THE  SUBROUTINES. 


INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 


KTTYIN-1 1 
INAME( 20) , 


STROW,  STCOL,  TT,TS,BS,VBS 


PARAMETER  NBUFSZ-300 
PARAMETER  NUMCOLS-128 
PARAMETER  NUMROWS-128 
PARAMETER  KTTYOUT-IO,  KTTYIN-1 1 
INTEGER  IBUF(NBUFSZ) ,  INAME(20),  NPIX 
INTEGER  IBUF1 (NBUFSZ) 

INTEGER  IBUF3(NBUFSZ) 

INTEGER  I  BUF5 ( NBUFSZ ) 

INTEGER  ICOL,  IROW,  I,  STROW,  STCOL,  TT,TS,BS,VBS 
INTEGER  NCOLS,  NROWS 
INTEGER  V  PRO J (12  8),  GAP(128) 

INTEGER  I  ARRAY (  256) 

INTEGER  I  ARRAY  2(  256) 

INTEGER  HPROJC128),  RESULT(128),  VCL(128) 

INTEGER  MX VHT ( 128) ,  MXVHTX(128) 

INTEGER  MI N VHT (128),  MINVHTX(128) 

INTEGER  VPW,  CLW,  MMW ,  AVCG(128),  AVCW,  MVC 

INTEGER  EVC( 128) ,  ROW 

INTEGER  VCLD( 128)  ,  VCLU( 128)  ,  IFND(  128) 

INTEGER  MARG,  BGT<128),  EDT(128) 

INTEGER  BGVC( 128) ,EDVC( 128) ,BGI( 128) ,EDI( 128) 

INTEGER  BGO ( 128),ED0(128),BGU(128),EDU(128),EDE(128) 
INTEGER  BGA ( 128), EDA (128), BGT ( 128),EDT(128),BGE(128) 
INTEGER  S  EG (12  8) 

INTEGER  MXHT1 (128), MNHT1 (128),MXHT2(128> , MNHT2 (128) 
INTEGER  IOR(20,3)  ,  I  UR ( 2  0 , 3)  , IER(20,3)  , IAR(20,3) 
INTEGER  XIOR(20,3),XIUR(20,3),XIER(20,3),XIAR(20,3) 
INTEGER  TBD(20,2)  ,  IVCL(20, 10)  ,  IBD(20,2) 


EXTERNAL  VP 
EXTERNAL  DV 
EXTERNAL  HP 
EXTERNAL  FH 
EXTERNAL  EV 
EXTERNAL  HT 
EXTERNAL  DT 
EXTERNAL  IF 
EXTERNAL  TF 
EXTERNAL  DA 

CALL  OVOPN ( A , "MAIN . OL" , IER) ;  OVERLAY  USED  FOR  MORE 

:  MEMORY 


o  o 


10 


i  r  -: 

I  L 


WRITE  ( KTTYOUT ,10) 

FORMAT  (IX, "INPUT  PICTURE  FILE  NAME-",Z) 
READ  (KTTYIN.20)  I NAME ( l ) 

FORMAT  ( S  3  9 ) 

CALL  PICFMT  ( I B UF , I N AME ,  I HDR  ) 

CALL  GFMT  ( I BUF , NRO WS , N COL S , NB I TS , I MODE ) 
CALL  MAKB  (IBUF) 

CALL  PICIN  ( IBUF , INAME , IHDR) 


ACCEPT  "NUMBER  OF  ROWS*", NROWS 
ACCEPT  "NUMBER  OF  COLUMN S -", NCO LS 
ACCEPT  "STARTING  ROW-",  STROW 
ACCEPT  "STARTING  COL.-",  STCOL 

STORE  WORD  IN  INITIALIZED  BUFFER  I BUF  5 

CALL  PFMT  ( I BUF 1 , NROWS , NCOLS , A , IMODE ) 

CALL  MAKB  (IBUF1) 

DO  26  I-  l.NROWS 

CALL  GROW( IBUF, ( STROW- 1  )  +  I  , STCOL, NCOLS , I ARRAY) 

CALL  PROW ( I BUF  1  , I  ,  1  , NCO  LS , IARRAY) 

CONTINUE 

CALL  RELB ( I BUF ) 

CALL  PFMT  (IBUF  , NROWS , NCOLS , A , I MODE > 

CALL  MAKB (IBUF  ) 

MAKE  OUTPUT  BUFFER 

CALL  PFMT  (  IBUF3 , 2  56 , 256  ,  A  ,  IMODE ) 

CALL  MAKB  (IBUF3) 

BLANK  OUT  THE  ENTIRE  BUFFER 

DO  65  1-1,256 

I A  RR A  Y ( I ) »  1  5 

CONTINUE 
DO  70  1-1,256 

CALL  PROW( IBUF3 , I  ,  1  ,  2  56  ,  IARRAY) 

CONTINUE 

MARG-5 

DO  80  I -  1  , NRO WS 

CALL  GROW( IBUF1 , I , 1 , NCOLS , IARRAY)  ;  ORIGINAL  WORD 

CALL  PROW( IBUF3 ,  I  , MARG , NCOLS ,  I  ARRAY )  ;  TOP  LEFT 

CONTINUE 

CALL  PFMT( I BUF 5 .NROWS , NCOLS, A, I MODE) 

CALL  MAKB( IBUF5 ) 

CALL  OVLOD ( A , VP  ,1,IER) 

CALL  VRTPROJ( IBUF l , NROWS , NCOLS , VPROJ , GAP , IBUF  ,IBUF5) 


-•'I--' 


f 


TYPE  "VRTPROJ  DONE" 


DO  90  I* 1 , NROWS 

CALL  GROW( IBUF  , I ,  1  , NCO'  S , i ARRAY) 

CALL  PROW ( IBUF3 , 4*(NROWS )  +  I ,MARG*2+1 20 , NCOLS 
CONTINUE 


PAINTS 
VRTPROJ 
0  5  RT. 

I  ARRAY  ) 


;  PAINTS 

DO  92  1*1, NROWS  ;  GAP 

CALL  GROW( I BUF5 , I  ,  1  ,  NCOLS  ,  I  ARRAY  )  ;  05  LFT . 

CALL  PROW( I BUF3 , 4* ( NROWS  )  ■*■  I  ,MARG, NCOLS,  IARRAY) 
CONTINUE 
TYPE  "92  DONE" 


CALL  OVLOD( 4 , DV  ,1,IER) 

CALL  DVCLIN< IBUF 1 , NROWS , NCOLS , VCLD , VCLU , VCL , IBUF  ) 
TYPE  "DVCLIN  DONE" 

; PAINTS 

DO  95  1=1, NROWS  ;DVCLIN2 

CALL  GROW (IBUF  , I  ,  1  , NCOLS  ,  I ARR AY )  ;03  RT . 

CALL  PROW( IBUF3 , 2*(NROWS)+I ,MARG*2+120, NCOLS, 
IARRAY ) 

CONTINUE 


CALL  O  V  LOD ( 4 , HP  ,1,IER) 

CALL  HRZPROJ( IBUF1 .NROWS, NCOLS, HPROJ, IBUF5) 

TYPE  "HRZPROJ  DONE" 

DO  100  1=1, NROWS 

CALL  GROW(IBUF5,I, l , NCOLS .IARRAY ) 

CALL  PROW( IBUF3 , 3* (NROWS  )+I,MARG,NCO 
CONTINUE 

CALL  OVLOD ( 4 , FH  ,1,IER) 

CALL  FDLHTC  NROWS , HPROJ , TT , TS , BS , VBS  ) 

TYPE  "TOP  TALL  «",TT 
TYPE  "TOP  SHORT  *",TS 
TYPE  "BOTTOM  SHORT  =",BS 
TYPE  "VERY  BOTTOM  SHORT  *",VBS 

CALL  PPNT( IBUF3 , TT , MARG- 2 , 0  )  ;  MARKS  PTS  .  WHERE  TT , TS , 
CALL  PPNT( IBUF3 ,TS ,MARG-2 ,0) ;  ETC.  ARE  LOCATED 
CALL  PPNT( IBUF3 , BS , MARG-2 , 0 ) 

CALL  PPNT( I BUF3 , VBS , MARG-2 , 0 ) 

CALL  SGA VC (IBUF  , TS , BS , NCOLS , NROWS , A VCG , VCL ) 

TYPE  " SGA VC  DONE" 

CALL  DDCR V ( I BUF 1 , TS.BS, NCOLS, NROWS, MXVHT, MX VHTX, 

I BUF5 .IBUF  ) 

TYPE  "DDCRV  DONE" 

DO  105  1*1, NROWS 

CALL  GROW(  IBUF5 , I  ,  1  .NCOLS,  IARRAY) 


;  PAINTS 
;  HRZHIST 
j  04  LFT 
LS , IARRAY) 


PA  I  NTS 
DDCRV 
0  l  RT 


CALL  PROW ( IBUF3, I, MARG *2+120, NCOLS , I ARRAY ) 


CONTINUE 


DO  106  I -  1  , NROWS 

CALL  GROW ( I BUF  , I  ,  1  , NCOLS ,  I ARR A Y  ) 
CALL  PROW( IBUF3  ,  I  ,MARG*2+120, NCOLS 

CONTINUE 


; PAINTS 
;P.  DDCRV 

;  //  1  RT  . 

I  ARRAY  ) 


CALL  DUCRV( IBUF1 , TS.BS, NCOLS, NR OWS.MINVHT, MINVHTX , 
IBUF5 , IBUF  ) 

TYPE  "DUCRV  DONE" 


; PAINTS 

DO  108  1*1, NROWS  ; DUCR  V 

CALL  GROW( IBUF5 ,  I  ,  1  .NCOLS , IARRAY)  ;02  RT . 
CALL  PROW( I BUF3 , 2  * (NROWS  )  + 1 , MARG * 2+ 1 2 0 , 
NCOLS  ,  IARRAY ) 

CONTINUE 


; PAINTS 

DO  107  1*1, NROWS  ;P. DUCRV 

CALL  GROW (IBUF  , I , 1 , NCOLS , I ARRA Y )  ;#2  RT 

CALL  PROW( I BUF3 , ( NROWS  )  +  I  , MARG *  2+ 1 2 0 , 

NCOLS , IARRAY) 

CONTINUE 


CALL  OVLOD ( A , EV  ,1,IER) 

CALL  ED VCL ( I BUF  , NROWS , NCOLS , ROW , E VC , TS , BS , BGVC , ED VC , 

I VCL , IBUF5 ) 

; PAINTS 

DO  160  1=1, NROWS  ; EDVCL 

CALL  GROW(  IBUF5  ,  I  ,  1  ,  NCOLS  ,  IARRAY)  ;  </ 3  LFT  . 

CALL  PROW( IBUF3,2*( NROWS  ) + I , MARG , NCOLS , I ARRA Y ) 
CONTINUE 

DO  155  1=1, NCOLS 

IF  ( EVC( I ) . NE  .  10 )  GOTO  155 

CALL  PPNT( IBUF3 , 3*(NROWS  ) -  1  , MARG *  2+ 1 2 0 -  1  + 1 , 0 ) 
CONTINUE 


PUTS  A  PIXEL  BY  ROW  OF  DVCLIN  SELECTED  IN  EDVCL 
CALL  PPNT( IBUF3 , 2*(NROWS  ) +ROW , MARG *  2  +  1 2 0 , 0 ) 

CALL  OVLOD ( U , HT  ,1,IER) 

CALL  HT VC  L ( IBUFl.BGVC.EDVC, NROWS , NCOLS , MXHT  1  ,MXHT2 , 
MNHTI , MNHT2 , BS , TS , I VCL) 

CALL  OVLOD ( A ,  DT  ,1,IER) 

CALL  DTOUAE ( IBUFl, NROWS, NCOLS, TS,BS,IBUF3,IBUF2, 
XIOR,XIUR,XIAR,XIER,IOR,IUR,IAR,IER)  ; 

TYPE  "DTOUAE  DONE" 

;  PAINTS 

DO  120  1=1, NROWS  ;  DTOUAE 

CALL  GROW( IBUF5  ,  I  ,  1  , NCOLS , I  ARRAY  )  ;  0  A  RT . 

CALL  GROW (IBUF3,3*(NROWS)+I,MARG*2+120, NCOLS, 
IARRAY  ) 


120 

C 


1  30 
C 


1  40 


/ 


CONTINUE 


;  PAINTS 

DO  122  1*1, NROWS  ;  P.  DTOUA 

CALL  GROW(  IBUF2  ,  I  ,  1  ,NCOLS  ,  IARRAY)  ;  It  4  LFT  . 
CALL  PROW( IBUF3 , 3*(NROWS  ) +1 , MARG , NCO LS , 
IARRAY  ) 

CONTINUE 


CALL  O VLOD ( 4 ,  IF  ,1,IER) 

CALL  I F I ND ( NCOLS , BG VC , ED VC , VCLD , VCLU , TS , BS , I FND , BG I  , 
/  EDI.IBD) 

TYPE  "IFIND  DONE" 

DO  130  I -  1  , NCOLS 

IF  (IFND(I).NE. 10)  COTO  130 

CALL  PPNT( IBUF3 .NROWS+l , MARG-l+I ,0) 

CONTINUE 

CALL  OVLOD (  4  ,  TF  ,1,IER) 

CALL  FINDT(IBUF1 , NCO LS , NRO WS , E VC , TS , B S , MXHT l ,MXHT2, 

/  MNHT1 ,MNHT2 , BGT,EDT,TBD) 

TYPE  "FINDT  DONE" 

DO  140  1 -  1  , NCOLS 

IF  ( ( BGT( I )  . NE .  1 0 )  . AND. (EDT( I  )  .NE.  1 0) )  GOTO  140 
CALL  PPNT( IBUF3 .NROWS  ,MARG-1+I,0) 

CONTINUE 


C 


1  50 

C 

C 

C 

1  10 


CALL  O  V  LO  D ( 4 ,  DA  ,l,IER) 

CALL  DECALG < NROWS .NCOLS , I VCL . I A R , I  UR , I OR , I ER , G A P . TBD , 
IBD ,  IBUF5  ) 

CALL  CLOSE (4,1 ER ) 

TYPE  " DEC A  LG  DONE" 

;  PAINTS 

DO  150  1*1, NROWS  ;  DECALG 

CALL  GROW(  IBUF5  ,  I  ,  1  ,  NCOLS  ,  I  ARRAY  )  ;  It  2  LFT. 

CALL  PROW( IBUF3 , (NROWS  ) + I , M ARC , NCO LS , I ARR A Y ) 
CONTINUE 

TYPE  "OUTPUT  DECALG  DONE" 

THE  FOLLOWING  SAVES  BUFFER  IBUF3  ON  DISK  AND  RELEASES 
OTHER  BUFFERS. 

WRITE  ( KTTYOUT ,110) 

FORMAT  ("OUTPUT  PICTURE  FILE  NAME-",Z) 

READ  (KTTYIN.20)  I NAME (  1  ) 

CALL  PICOUT  ( IBUF3, INAME, IHDR) 

CALL  RELB( IBUF  ) 

CALL  RELB(IBUFl) 

CALL  RELB (  I  BUF  3  ) 

CALL  RELB ( I BUF  5 ) 

END 
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£***************** 

C  SUBROUTINE  NAME:  VERTICAL  PROJECTION  (VRTPROJ.FR) 

C  WRITTEN  BY:  1 LT •  DAVID  V.  SOBOTA 

C  PURPOSE:  THIS  PROGRAM  WILL  DISPLAY  A  PROJECTION  OF  THE  It 

C  It  OF  PIXELS  IN  A  COLUMN  ON  THE  HORIZONTAL  AXIS  TO  PRODUCE 
C  VPROJ.  THE  VPROJ  ARRAY  IS  PROCESSED  TO  DETERMINE 
C  POSSIBLE  GAPS  OR  SPACES  BETWEEN  LETTERS. 

******************************************** 


OVERLAY  VP 

SUBROUTINE  VRTPROJ( IBUFI .NROWS , NCOLS , VPROJ .GAP , IBUFOl 
I  BUF02  ) 

PARAMETER  NBUFSZ-300 
PARAMETER  NUMCL-256 
INTEGER  IBUFI (NBUFSZ) 

INTEGER  IBUFOl (NBUFSZ) 

INTEGER  IBUF02(NBUFSZ) 

INTEGER  ICOL,  IROW 
INTEGER  NCOLS.  NROWS 
INTEGER  VPROJ(NUMCL) 

INTEGER  IARRAY(NUMCL) 

INTEGER  IARRYl (NUMCL) 

INTEGER  IARRY2(NUMCL) 

INTEGER  GAP ( NUMCL ) 


DO  30  ICOL-1,256 
V  PRO J (  I COL ) * 0 
CONTINUE 


;  ZEROS  VPROJ  ARRAY 


THIS  LOOP  COUNTS  NUMBERS  OF  DARK  P I XE LS / COL UMN 
DO  35  IROW-1, NROWS 

CALL  GROW ( IBUFI .IROW, l , NCOLS , I ARRAY) 

DO  40  ICOL- 1 , NCOLS 

IF  ( IARRAYC ICOL) .EQ.O)  VPROJ(ICOL)- 
VPROJ  (  ICOD  +  l 
CONTINUE 
CONTINUE 

THIS  LOOP  PRODUCES  THE  GAP  ARRAY  WHICH  GIVE  THREE 
LEVELS  OF  CONFIDENCE  AS  TO  WHETHER  THERE  IS  A  SPACE 
BETWEEN  THE  LETTERS  AT  A  PARTICULAR  COLUMN  LOCATION. 

DO  60  K- 1 , NCOLS 
GAP ( K ) “0 

IF  ( (VPROJ(K) . EQ. 1 ) .AND. ( (VPROJ(K+3) .GE. 5 ) .OR. 

( VPROJ(K-3 ) .GE . 5 ) ) )  GAP(K)-5 

IF  ( ( VPROJ(K) . EQ. 0) . AND . ( ( VPROJ(K+3 ) .GE . 5 ) . OR . 
<VPROJ(K-3) .GE.  5)  )  )  GAP(K)-10 

IF  ( ( VPROJ (K) .EQ.O) .AND. ( (VPROJ(K-l ) . LE. 1 ) .OR. 

( VPROJ ( K+ l ) . LE . 1 ) ) )  GAP(K)-l 5 


u  u 


CONTINUE 


THE  FOLLOWING  PUTS  THE  VPROJ  DATA  IN  IBUFOl  FOR  DISPLAY 
THE  FOLLOWING  PUTS  THE  GAP  DATA  IN  IBUF02  FOR  DISPLAY 

DO  50  IROW- 1 , NROWS 
DO  55  ICOL-l, NCOLS 

IARRY1 ( I COL  >“15 
IARRY2( ICOL)- 1 5 

IF  (VPROJ( ICOL) .GE. IROW)  I ARR Y 1 ( I COL ) - 0 
IF  (GAP( ICOL) .GE.  IROW)  I ARR Y 2 ( I CO L ) - 0 

CONTINUE 

CALL  PROW( IBUFOl , NROWS - I R0W+ 1 , 1 , NCOLS , I ARR Y 1 ) 

CALL  PROW ( IBUF0  2 , N RO WS - I ROW+ 1  .  1  , NCOLS  ,  I ARR Y 2 ) 
CONTINUE 

RETURN 

END 


v  . 
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Q  *********************************************************** 

SUBROUTINE  NAME:  DETECT  VERTICAL  CONTINUOUS  LINES 

(DVCLIN.FR) 

WRITTEN  BY:  1  LT  .  DAVID  V.  SOBOTA 

PURPOSE:  THIS  SUBROUTINE  FINDS  SIGNIFICANT  VERTICAL 

CONTINUOUS  LINES  (VCL'S)  BY  LOOKING  AT  EACH  COLUMN  IN  THE 
BUFFER  CONTAINING  THE  WORD  AND  COUNTS  THE  NUMBER  OF 
CONTINUOUS  DARK  PIXELS  IN  A  COLUMN  UNTIL  IT  SEES  A  GAP  OF 
2  WHITE  PIXELS.  IT  PERFORMS  THIS  COUNT  STARTING  FROM  THE 
BOTTOM  AND  THE  TOP  OF  THE  WORD  AND  CHOSES  THE  BIGGEST  VCL 
TO  ACCOUNT  FOR  LETTERS  LIKE  "J"  AND  "I". 
************************************************************ 


OVERLAY  DV 

SUBROUTINE  DVCLIN(IBUFIN»NROWS, NCOLS, VC LD.VCLU, VCL, 
/  IBUFOUT) 


INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 


INCR (256) 

VCL(  2  56  )  , VC LD 
IGAP(256) 

I ARRAY  1  (  2  56  ) 
IARRAY2<256) 

I ARR Y  1  (256)  , 
JARRAY (256) 
START (256) 

I BUF I N ( 3 00  ) 
IBUFOUT( 300  ) 
NROWS , NCOLS 


{256),  VCLU (256) 


I ARR Y2 (256) 


THIS  CODE  SCANS  EACH  COLUMN  FROM  TOP  TO  BOTTOM  TO  FIND 
VCLD  ARRAY 


DO  10  I COL«  1 
VCL (  ICOL)-0 
VCLD( ICOL)-0 
I G AP ( I COL ) *  0 
I  NCR ( ICOL)- 1 
START ( I COL ) ■ 
0  CONTINUE 


, NCOLS 


0 


;  THIS  LOOP  INIT.  COUNT  PARAMETERS 
LARGEST  VCL  ARRAY 
DOWNWARD  VCL’S  ARRAY 

STORES  COUNT  OF  GAPS  OF  DARK  PIXELS 
INCREMENT  FLAG 

START  COUNT  OF  DARK  PIXELS  FLAG 


DO  20  IROW-1 .NROWS-l 

CALL  GROW( I BUFIN ,  I R0W+ 1 , 1 , NCOLS , I ARR A Y 1 ) 

CALL  GROW ( I BUF I N ,  IROW  , 1 , NCOLS , I ARR A Y 2 ) 

DO  30  ICOL- 1 , NCOLS 
IF  ( .NOT. ((IARRAYI(ICOL).EQ.O).AND. 

/  ( IARRAY2( ICOL) .EQ.O) ) )  GOTO  40 

START( ICOL) •  1 

VCLD( ICOL)-VCLD( I  COL )  + 1 NCR ( I  COL ) 

GOTO  50 

40  CONTINUE 

IF  (  STARTC ICOL) . EQ .  0)  GOTO  30 
IGAP(ICOL)-IGAP(ICOL)+l 


n  n 


V _V Jf.’V-  JWJWWJ vj i r; srr.’vr+n ‘ TOrarmrrwms » . otot mjuik '_*i iwuwwin innn'*-sm»* m 


IF  (  IGAP( ICOL) . GE. 2  )  INCR(ICOL)-0 

GOTO  30 

50  CONTINUE 

30  CONTINUE 

20  CONTINUE 

C 

C  THIS  CODE  PERFORMS  THE  SAME  VCL  COUNT  AS  ABOVE  BUT 

C  FROM  OPPOSITE  THE  DIRECTION 

C 

DO  55  1-1, NCOLS 

VCLU ( I ) *0  ;  UPWARD  VCL  ARRAY  IS  ZEROED 

I  NCR ( I  )- 1 
IGAP ( I )-0 
START ( I ) *0 
55  CONTINUE 

C 

DO  60  IROW-1 .NR0WS-1 

CALL  GROW ( IBUFIN .NR0WS-IR0W+1  , 1 .NCOLS , I ARRAY  1 ) 
CALL  GROW< IBUFIN , NR0WS- IROW  , 1 , NCOLS , I ARR A Y 2 ) 
DO  70  ICOL-1, NCOLS 

IF  (  . NOT. <  < I ARRAY l ( I COL ) . EQ . 0 ) . AND . 

/  ( IARRAY2( ICOL) . EQ . 0  )  )  )  GOTO  80 

START ( ICOL)  -  1 

VCLU( ICOL)- VCLU< ICOL )+INCR( ICOL) 

GOTO  90 

80  CONTINUE 

IF  (  START(ICOL) .EQ.  0)  GOTO  70 

IGAP( IC0L)-IGAP( ICO  L  )  ■<■  1 

IF  (  IGAP( ICOL) .GE . 2  )  INCR(IC0L)-0 

GOTO  70 

90  CONTINUE 

70  CONTINUE 

60  CONTINUE 

THIS  LOOP  COMPARES  EACH  COL.  OF  VCLU  AND  VCLD  AND 

C  PICKS  THE  BIGGEST  AND  STORES  RESULTS  IN  THE  VCL  ARRAY 

C 

DO  100  I- 1 , NCOLS 

VC  L ( I ) -VCLU( I ) 

IF  (VCLU(I).GT.  VCLD(I))  GOTO  100 
VC  L ( I ) - VCLD ( I ) 

100  CONTINUE 

C 

C  PUT  VCL'S  IN  NEW  BUFFER  IBUFOUT  FOR  DISPLAY 

C 

DO  110  IROW-1, NROWS 
DO  115  ICOL-1 .NCOLS 

I ARR AY  2 ( ICOL)-l 5 

IF  (VCL( ICOL) .GE. IROW)  I  ARR A Y 2 ( I  CO L ) -0 


67 


’.VwV. 

-  -*  j&j 


1  l  5 


CONTINUE 


CALL  PROW ( IBUFOUT , NROWS - I ROW+ 1  ,  1  , NCOLS , I  ARRAY  2  ) 
CONTINUE 

SMOOTHS  ORIGINAL  VCL  TO  FILL  IN  SMALL  VALLEYS  WHICH 
PRODUCE  ERRORS  IN  FINDING  THE  EDGES  OF  SIGNIFICANT 
VCL'S  (EDVCL.FR) 

DO  140  IROW- 1 , NROWS- 1 

CALL  GROW (IBUFOUT, IROW,  1 , NCOLS , I ARR Y I  ) 

CALL  GROW ( IBUFOUT, IROW+1  , 1  .NCOLS , I ARR Y 2 ) 

DO  150  J- 1 , NCOLS 

IF  ( ( I ARR  Y  2 ( J ) . EQ . 0 ) . AND . ( ( I ARR Y 1 ( J - 1 ) . EQ . 0 ) . OR . 

( IARRY 1 ( J+l ) . EQ . 0 )  )  )  I ARR Y 1 ( J ) “0 
CONTINUE 

CALL  PROW ( IBUFOUT, I  ROW  ,  1  , NCOLS , IARRY 1  ) 

CONTINUE 


RETURN 


*  *  *  *  * 


*  * 


*  * 


K», 


c ************ * 

C  SUBROUTINE  NAME:  HORIZONTAL  PROJECTION  (HRZPROJ.FR) 

C  WRITTEN  BY:  1 LT .  DAVID  V.SOBOTA 

C  PURPOSE:  THIS  ROUTINE  COUNTS  THE  NUMBER  OF  DARK  PIXELS  IN 
C  EACH  ROW  AND  PROJECTS  THE  NUMBER  ON  THE  VERTICAL  AXIS. 

C  THE  RESULTS  ARE  USED  IN  FDLHT.FR  TO  FIND  THE  HEIGHTS  OF 
C  VARIOUS  LETTERS. 


C*  *  * 


OVERLAY  HP 

SUBROUTINE  HRZPROJ(IBUFl, NROWS , NCOLS , HPROJ , IBUF2  ) 
PARAMETER  NUMROWS-256 
INTEGER  HPROJ(NUMROWS ) 

INTEGER  I ARRAY  1  (  2  36) 

INTEGER  I ARRAY  2 (256) 

DO  10  IROW-1, NROWS  ;  ZEROS  PROJECTION  OF  PIXELS/ROW 

HPRO J ( IROW)-0 

CONTINUE 

DO  20  IROW-1, NROWS 

CALL  GROW( IBUF1  , I ROW,  1  , NCOLS , I ARRAY  1 ) 

DO  30  ICOL- 1 , NCOLS 

IF  ( IARRAY 1 ( ICOL) .EQ.  0)  HPROJ(IROW)- 
HPRO J ( I  ROW )  +  1 
CONTINUE 
CONTINUE 

THE  FOLLOWING  PUTS  HPROJ  IN  IBUF2  FOR  DISPLAY 

DO  AO  IROW-1, NROWS 
DO  50  ICOL- 1 , NCOLS 

IARRAY2 ( ICOL)-  1  5 

IF  <HPROJ< IROW) .GE.  ICOL)  I ARRA Y 2 ( I COL ) - 0 

CONTINUE 

CALL  PROW( IBUF2 , IROW, 1 .NCOLS , I  ARRAY 2 ) 

CONTINUE 


RETURN 


;*,vvvvvr' 


SUBROUTINE  NAME:  FIND  LETTER  HEIGHT  (FDLHT.FR) 

WRITTEN  BY  :  1 LT .  DAVID  V.  SOBOTA 

PURPOSE:  THIS  PROGRAM  USES  HPROJ  PASSED  BY  HRZPROJ.FR  TO 
FIND  FOUR  ASSOCIATED  HEIGHTS  IN  SMALL  LETTERS : TT ,  TS,  BS , 
AND  VBS .  THESE  DIFFERENT  HEIGHTS  ARE  EXPLAINED  BELOW.  THESE 
HEIGHTS  ARE  USED  AS  CLUES  TO  DETERMINE  LOCATION  OF  POSSIBLE 
LETTERS . 


OVERLAY  FH 

SUBROUTINE  FD LHT ( NROWS , H PRO J , TT , TS , B S , V BS  ) 

PARAMETER  NUMROWS-256 
INTEGER  HPROJ ( NUMROWS ) ,  DTHR 

INTEGER  I,  IROW,  NROWS,  TPIX,  NAVPX,  TT ,  TS ,  BS,  VBS 

FIRST  FIND  AVE.  »  OF  PIXELS/ROW  TO  DETERMINE  THRESHOLD 

TPIX-0  ;  COUNT  OF  TOTAL  NUMBER  OF  PIXELS  IN  WORD 
N-0  ;  COUNTS  It  OF  ROWS  W/  SIGNIFICANT  It  PIXELS 

DO  1 0  IROW- 1 , NROWS 

TF  (HPROJ< IROW) . LT.  2)  GOTO  15 

TPIX-HPROJC I  ROW )+TP I X 

N-N+l 

CONTINUE 

CONTINUE 

NAVPX-INT(TPIX/N)  ;  NAVPX  -  AVE.  It  OF  PIXELS/ROW 

TYPE  "TPIX-",  TPIX 
TYPE  " N - " ,  N 
TYPE  "NAVPX-",  NAVPX 
TYPE  "NROWS-",  NROWS 

TT  -  TOP  OF  TALL  LETTERS  (  h , f , 1 , b , d , ETC .  ) 

TS  -  TOP  OF  SHORT  LETTERS  (  o , a , c  ,  e , p , g , ETC .  ) 

BS  -  BOTTOM  OF  SHORT  LETTERS  (  o , a , c , e , ETC .  ) 

VBS  -  VERY  BOTTOM  OF  SHORT  LETTERS  (  g , p , q , j , ETC .  ) 

THIS  CODE  STARTS  FROM  THE  TOP  OF  HPROJ  TO  DETERMINE 
WHEN  A  SIGNIFICANT  It  OF  PIXELS  IS  ENCOUNTER  AND  LESS 
NAVPX  TO  FIND  TT  AND  GREATER  THAN  NAVPX  TO  DETERMINE  TS 

TT-0  ;  THESE  VALUES  ARE  INITIALIZED 

TS-0 

BS-0 

VBS  -  0 

DO  30  1*1, NROWS 

IF  ( . NOT . ( <  HPROJ ( I ) . LT .  HPRO J ( 1+ l ) ) . AND . 

/  (  HPROJ  (I-M).GE.  1  )  .AND.  (HPROJ(  1  +  2)  ,GT.  1)))  GOTO  35 

IF  (  TT  .EQ.  0  )  TT - I + l 

CONTINUE 

IF  ( . NOT . (  (  HPROJ ( I )  .LT.  HPROJ(I+l))  .AND. 

/  (  HPROJ ( 1+ 1 ) . GT .  NAVPX  ) . A ND . ( TT . N E . 0 )  ))  GOTO  30 


i  t 


i  i 


IF  (  TS  .EQ.  0  )  TS-I 
IF  ( (TS-TT)  . LE . 3)  TS-TT 
IF  (TS.NE.O)  GOTO  40 
CONTINUE 
CONTINUE 

THIS  CODE  STARTS  FROM  THE  BOTTOM  OF  HPROJ  TO  DETERMINE 
WHEN  A  SIGNIFICANT  #  OF  PIXELS  IS  ENCOUNTER  AND  LESS 
NAVPX  TO  FIND  VBS  AND  GREATER  THAN  NAVPX  TO  DETERMINE  BS 

DO  45  1-1, NROWS 

IF  ( . NOT . ( (  HPROJ(NROWS-I+l ) .LT.  HPRO J < NROWS- I ) ) . AND . 
/  (  HPROJ(NROWS-I) .GE.  1 ) . AND . ( H PRO J ( NROWS - I - 1 ) . GT .  1) 

/  ))  GOTO  50 

IF  (VBS.EQ.O)  VBS-NROWS-I 
CONTINUE 

IF  (  .  NOT . ( (  HPR0J(NR0WS-I+1 ) .LT.  HPRO J ( NROWS- I )). AND . 
/  (  HPROJ(NROWS-I ) .GT.  N A V PX ) . AND . ( V BS . NE . 0 ) > )  GOTO  45 

IF  (  BS  .EQ.  0  )  B  S "NROWS - 1  + 1 

DTHR  ■ ( BS -TS ) / 3 
IF  ( ( VBS-BS) . LE. DTHR)  BS-VBS 
IF  (BS.NE.O)  GOTO  55 
CONTINUE 
CONTINUE 

IF  ( (TS-TT) .LT. DTHR)  TS-TT 

RETURN 

END 


k  r= 

I  l 


c************************************************************ 

C  SUBROUTINE  NAME:  EDGES  OF  VERTICAL  CO NT.  LINES  (EDVCL.FR) 
WRITTEN  BY:  1 LT .  DAVID  V.  SOBOTA 

PURPOSE:  THIS  SUBROUTINE  TAKES  THE  RESULTS  OF  DVCLIN.FR 
AND  FINDS  THE  SIGNIFICANT  EDGES  OF  THE  VCL'S  BY  FINDING 
VCL'S  WHOSE  HEIGHT  IS  AT  LEAST  EQUAL  TO  1/2(BS  -TS). 


******* 


OVERLAY  EV 

SUBROUTINE  ED VCL ( I B U F I N , NROWS , NCOLS , ROW , E VC , TS , BS , 

BG VC, ED VC, I VCL, IBUFOUT) 

INTEGER  NCOLS, NROWS , N , N B I G , NTMP , SUM 
INTEGER  IBUFIN( 300) .IBUFOUT (300) , IARRY1 ( 256) 

INTEGER  WV  C (  2  5  6  ) ,  STCL(  2  56),  IARRY2(256) 

INTEGER  R WVC ( 256),  RSTCL(256) 

INTEGER  VCL ( 256) 

INTEGER  EVC(256) ,  PEDGI,  PEDG2 
INTEGER  ROW 
INTEGER  NSEG (256) 

REAL  AVG,  STDTMP ,  SDSEG(256),  STDSML 

INTEGER  BGVC(256) ,EDVC(256) 

INTEGER  TS.BS.MID, IVCL(20, 10) 

NBIG-0 

THIS  LOOP  LOOKS  AT  EACH  ROW  OF  IBUFIN  PASSED  BY 
DVCLIN.FR  AND  STORES  THE  STARTING  POSITION  AND  WIDTH 
OF  EACH  VCL  ENCOUNTERED  IN  THE  ROW. 

DO  20  1-3, NROWS  ;  INITIALIZED  PARAMETERS 

DO  10  N- l , ( NCOLS / 2 ) 

WVC ( N ) -0  ;  WIDTH  VCL’S  ARRAY 

STCL ( N ) -  1  ;  STARTING  COL.  POSITION 

CONTINUE 

N-l  ;  INITIALIZED  ft  OF  VCL'S  COUNT 

FLG-0 

CALL  GROW( IBUFIN , NROWS -1+1 , 1 , NCOLS , I ARR Y 1 ) 

DO  30  J-l, NCOLS 

IF  ( (FLG. EQ . 0 ) . AND. ( IARRY 1 ( J ) . EQ. 0 ) )  STCL(N)«J 
IF  (IARRYl(J) . EQ . 0 )  GOTO  35 
GOTO  40 


CONTINUE 


CONTINUE 

] 

GOTO  30 
CONTINUE 


WVC(N)-WVC(N)+1 
FLG-  1 

IF  ( ( I ARRY 1 ( J ) . EQ . 1 5 ) . AND. (FLG . EQ . 1 ) )  GOTO  45 


N-N+  l 
FLG-0 


CONTINUE 


W-V-T  -  WWirj.  iT-J  rf'-f  >r  - 


NTMP-N-1  ; NTMP  IS  NUMBER  OF  SEGMENTS  IN  GIVEN  ROW 

NSEG(I)-NTMP  ; STORES  NTMP  IN  EACH  ROW  IN  N  S  EG 
IF  (NTMP  .GT.  NBIG)  NBIG-NTMP  ;FINDS  BIGGEST  NTMP 
C 

SUM-0 

DO  70  K-l.NTMP 

SUM-SUM+WVCOO 

70  CONTINUE 

AVG-(SUM/ NTMP )  ;FINDS  AVE .  WIDTH  OF  SEGMENTS 
STDTMP-0 . 0 
DO  80  K- 1 , NTMP 

STDTMP-STDTMP+ABS ( AVG-WVC(K)) 

80  CONTINUE 

S  DS  EG ( I )«(STDTMP/NTMP) 

20  CONTINUE 

C 

C  THIS  LOOP  PICKS  THE  ROW  WITH  NEAR  GREATEST  NUMBER  OF 

C  SEGMENTS  AND  SMALLEST  STD.  FOR  ITS  WIDTHS  AND  FINDS 

C  THE  EDGES  AGAIN  TO  REPRESENT  THE  EDGES  OF  THE  SIGNIFICANT 

C  VCL'S.  THE  EDGES  ARE  THEN  STORED  IN  BGVC  AND  EDVC. 

C 

STDSML-500 
DO  120  I-3.NROWS 

IF  ( (NSEG(I)  .EQ.  NBIG) .OR . (NSEG( I  )  . EQ.  (NBIG-1  )  ) ) 

/  GOTO  130 
GOTO  120 
130  CONTINUE 

C  TYPE  I,  "  NSEG-",  NSEG(I),  "  SDSEG-",  SDSEG(I) 

IF  ( S  DSEG ( I ) . LE .  STDSML )  GOTO  140 
GOTO  120 
140  CONTINUE 

STDSML-SDSEGC I  ) 

ROW-NROWS-I+1 
120  CONTINUE 

C 

DO  125  K-  1  , NCOLS 
W VC ( K ) -0 
STCL  (  K  )  «■  1 
125  CONTINUE 

N-  1 
FLG-0 

MID-NROWS- ( BS-TS ) / 2  ;  PREVIOUS  CODE  NULLED 

ROW-MID  ;  ****** 

CALL  GROW( IBUFIN.ROW  , 1  , NCO LS , I AR R Y 1 )  ;  *  *  *  *  * 

DO  150  J-l, NCOLS 

IF  ( (FLG . EQ . 0 ) . AND . ( IARRY 1 ( J) . EQ . 0  )  )  STCL(N)«J 
IF  (IARRYl(J)  .  EQ  .  0  )  GOTO  155 

GOTO  160 

155  CONTINUE 

WVC(N)-WVC(N)+1 
FLG-  1 

160  CONTINUE 

IF  ( ( I ARRY 1 ( J ) . EQ . 1 5 ) . AND . ( FLG . EQ . 1 ) )  GOTO  165 


73 


% 


GOTO  150 
CONTINUE 


N-N+l 

FLG-0 

CONTINUE 

NUMSEG-N-1 

DO  90  K- 1 , NUMSEG 

RWVC(K)-WVC(K) 

RSTCL(K)-STCL(K) 

CONTINUE 

DO  95  1-1, NUMSEG  ;* 

TYPE  I,  "  W",  RWVC(I),  "  STC",  RSTCL(I)  ; * 
CONTINUE  ;* 

TYPE  "STDSML",  STDSML,  "  NTMP",  NTMP  NBIG",  NBIG 

TYPE  "ROW-  ",  ROW  ; * 

DO  100  I-l.NCOLS 
E VC ( I ) -0 
BG VC ( I ) -0 
EDVC ( I ) -0 

CONTINUE 

EDGES  OF  SIGNIFICANT  VCL'S  IS  THEN  STORED  IN  IVCL  ARRAY 
DO  200  J -  1 ,20 
DO  210  K—  1  ,  10 
IVCL( J,K)-0 
CONTINUE 
CONTINUE 

DO  110  1-1 .NUMSEG 
PEDG 1  -RSTCL( I )-  l 
PEDG2-RSTCL( I ) +RW VC ( I  ) 

EVC( PEDG 1 ) -  1 0  ;  STORES  EDGES  OF  VCL’S  IN  EVC 

EVC(PEDG2)-10 

BGVC(PEDG1)-10  ;  STORES  BEGINNING  OF  VCL'S  IN  BGVC 
EDVC(PEDG2)-10  ;  STORES  END  OF  VCL'S  IN  EDVC 
I VCL ( I , 1 ) -PEDG 1 
I VCL ( I , 2 ) -PEDG2 
CONTINUE 


RESULTS  ARE  PUT  IN  IBUFOUT  FOR  DISPLAY 

DO  250  I - 1 , NROWS 
DO  260  J- 1 , NCOLS 

IARRY2 ( J) -  1 5 

IF  (EVC(J) .GE. I)  I ARRY  2 ( J ) -0 

CONTINUE 

CALL  PROW ( IBUFOUT .NROWS- 1+1 ,1 ,NCOLS,IARRY2> 
CONTINUE 


TYPE  " ED VCL  DONE" 
RETURN 


iF 


r*********************************************************** 

SUBROUTINE  NAME:  FIND  HEIGHT  OF  VCL ' S  (HTVCL.FR) 

WRITTEN  BY:  1 LT .  DAVID  V.  SOBOTA 

PURPOSE:  THIS  ROUTINE  FINDS  THE  MAX.  AND  MIN.  HEIGHTS  OF 
CONTINUOUS  DARK  PIXELS  IN  THE  FIRST  HALF  AND  SECOND  HALF 
OF  EACH  VCL  REGION  DETECTED  IN  EDVCL.FR.  THIS  INFORMATION 
IS  USED  TO  DETECTED  T'S  AND  DISTINQUISH  I'S  FROM  J'S  AND 
ALSO  TO  GIVE  SEGMENTATION  CLUES  IN  DECALG.FR. 

i************* 


OVERLAY  HT 

SUBROUTINE  HTVCL(IBUFI,BGVC,EDVC, NROWS , NCOLS , MXHT 1 , 
MXHT2 , MNHT 1 , MNHT2 ,BS,TS,IVCL) 

INTEGER  I BUF 1(300) 

INTEGER  NROWS , NCOLS , START , END 
INTEGER  IARRY 1(128),  IARRY2(128) 

INTEGER  STFLG( L 28 ) , STZD( 1 28 ) 

INTEGER  STZU(  128)  ,  NBZ 
INTEGER  BGVC( 1 28 ) , EDVC( 1 28 ) 

INTEGER  MXROW ,  MNROW,  MNHT1(128),  MXHT1(128) 

INTEGER  MNHT2 ( 1 28 ) , MXHT2 ( L 28 ) ,  MDD 
INTEGER  N  ,  BS  ,  TS ,  I  VCL( 20 , 1 0 ) 


DO  10  1*1, NCOLS 
STFLG ( I ) -0 
STZD ( I ) *0 
MNHT 1 ( I ) “ 0 
MXHT 1 ( I  )  * 0 
MNHT2 ( I ) -0 
MXHT  2 ( I ) *  0 
CONTINUE 


INITIALIZE  PARAMETERS 


THIS  LOOP  FINDS  THE  BEGINNING  AND  END  OF  EACH  VCL  AND 
THE  MIDDLE  OF  EACH  VCL. 


DO  15  K- 1 , NCOLS 

IF  (BGVC(K) .EQ. 10)  START-K 

IF  <EDVC(K) .NE. 10)  GOTO  15 

END-K 

MXROW*  2  5  6 

MNROW- 1 

MDD-INT( ( END-START+1 ) /2 )  ;  MIDDLE  OF  VCL 

THIS  LOOP  FINDS  THE  MAX.  FOR  THE  FIRST  HALF  OF  THE  VCL 
DO  20  IROW- 1 , NROWS- 1 

CALL  GROW (  I BUF I  ,  IROW  ,  1  , NCO LS , I ARR Y 1  ) 

CALL  GROW (  I BUF  I  ,  I ROW+ 1  ,  1  , NCOLS ,  I  ARR Y 2 ) 

DO  30  ICOL-START , ( START+MDD- 1 ) ;  1ST  HALF  VCL 
NBZ-0 

IF  (( IARRY 1 ( ICOL- 1 ). EQ . 0 ). OR . 


v-vv 


/  ( IARRY1 ( ICOL+1 ) . EQ.O) )  NBZ-1 

IF  ( STFLG (ICOL).EQ.l)  GOTO  30 
IF  (( IARRY 1 ( ICOL ). EQ . 0 ). AND . 

/  ( ( I ARRY2< ICOL) .EQ . 0 ) . OR . (NBZ . EQ . 1 ) ) )  GOTO  35 

GOTO  30 

CONTINUE 

STFLG ( ICOL)- 1 
STZD ( ICOL)-IROW 

IF  ( STZD( ICOL)  . LT . MXROW)  MXROW- STZD < I  COL ) 

CONTINUE 

CONTINUE 

MXHT1 (START)-MXROW  ;  MAX.  HEIGHT  PUT  START  POSITION 

DO  37  1  =  1, NCO  LS 
STFLG ( I ) =0 
STZU  < I ) -0 
CONTINUE 


THIS  LOOP  FINDS  THE  MIN.  FOR  THE  FIRST  HALF  OF  THE  VCL. 


/ 

/ 


DO  40  IROW* 1 , NROWS- 1 

CALL  GROW( I BUFI , NROWS - I ROW+ 1  ,  1  ,NCOLS ,  IARRY  1  ) 
CALL  GROW( I BUF I , NROWS- IROW  , 1 , NCO LS , I ARR Y 2 ) 
DO  50  ICOL«START,(START+MDD-l ) 

NBZ-0 

IF  (( IARRY 1 ( ICOL- 1 ). EQ . 0) .OR . 

( IARRY l ( ICOL+1 ). EQ . 0) )  NBZ-l 
IF  ( STFLG( ICOL) . EQ . 1 )  GOTO  50 
IF  <(  IARRY 1 <  ICOL) . EQ . 0  ). AND . 

( ( I ARRY2 ( ICOL)  . EQ . 0  )  . OR .  (NBZ . EQ .  1 ) ) )  GOTO  55 
GOTO  50 

CONTINUE 

STFLG ( ICOL)-  1 

STZII  (  ICOL) -NROWS  -  IROW 

IF  (STZU( ICOL) .GT.MNROW)  MNROW- STZU ( I COL ) 

CONTINUE 

CONTINUE 


MNHT1 ( START ) -MNROW  ;MIN.  HEIGHT  PUT  IN  START  POSTION 

NUMBERS  ARE  STORED  IN  IVCL(N,3)  ACCORDING  TO  DIFFERENT 
HEIGHTS 

IF  ( ( MXROW. GE . (TS-2 )). AND . (MNROW. LE .( BS+2 )) ) 

/  IVCL(N,3)-1 

IF  ( (MXROW . LE. (TS-2)). AND. (MNROW. LE. (BS+2))) 

/  IVCL(N,3)-2 

IF  ( ( MXROW. GE. (TS-2)). AND. ( MNROW. GE. (BS+2))) 

/  IVCL(N,3)-3 

I  F  ( (MXROW. LE . (TS-2)). AND. ( MNROW. GE. (BS  +  2))) 

/  I  VCL( N , 3 ) -4 


MXROW- 256 


t  C 


MNROW  - 1 

THIS  LOOP  FINDS  THE  MAX.  HEIGHT  FOR  THE  SECOND  HALF 
OF  THE  VCL 


DO  60  IROW-1 .NROWS-l 

CALL  GRO  W( IBUFI  ,  IROW  ,  1  , NCOLS , I A  R  R  Y 1  ) 

CALL  GROW( I BUF I ,  I ROW+ 1 , 1 , NCOLS , I A RR Y 2 ) 

DO  70  ICOL- ( END-MDD+1 ) , END 
NBZ  *0 

IF  ( ( IARRY1 (  ICOL-1  )  .  EQ  .  0 )  .OR. 

/  ( IARRY 1 ( ICOL+l ). EQ . 0 ) )  NBZ-l 

IF  ( STFLG( ICOL) . EQ . 1 )  GOTO  70 
IF  (( IARRY1 < ICOL) . EQ.O) .AND. 

/  < ( I ARRY2 ( ICOL) . EQ. 0) . OR . (NBZ . EQ. 1 ) ) )  GOTO  75 

GOTO  70 

CONTINUE 

STFLG ( IC0L)*1 
STZD ( ICOL)-IROW 

IF  ( STZD( ICOL) . LT.MXROW)  MXROW-STZD( ICOL ) 

CONTINUE 

CONTINUE 

MXHT2(END)-MXR0W  ; MAX .  HEIGHT  PUT  IN  END  POSITION 

DO  77  1-1 .NCOLS 

STFLG  < I  )  =0 
STZUC I )«0 
CONTINUE 

THIS  LOOP  FINDS  THE  MIN.  HT .  FOR  THE  2ND  HALF  OF  THE  VCL 
DO  80  IROW-1 .NROWS-l 

CALL  GROW  < IBUFI , NROWS-IROW+1  ,  1  .NCOLS . IARRY 1  ) 
CALL  GROW( IBUFI  .NROWS-IROW  ,  1  , NCOLS ,  I  ARR Y 2 ) 
DO  90  I COL- ( END-MDD+ 1 ) .END 
NBZ-0 

IF  (< IARRY1 < ICOL-1 ). EQ.O). OR. 

/  (  IARRY l ( ICOL+l ). EQ . 0) )  NBZ-l 

IF  ( STFLG( ICOL) . EQ . 1 )  GOTO  90 
IF  (( IARRY l ( ICOL) . EQ . 0) . AND . 

/  ( ( IARRY2 ( ICOL) . EQ . 0 ) .OR . (NBZ . EQ. 1 ) ) )  GOTO  95 

GOTO  90 


CONTINUE 


STFLG( ICOL) - 1 
STZU(ICOL)-NROWS-IROW 

IF  ( STZU( ICOL) .GT. MNROW)  MNROW- STZU ( I COL ) 


CONTINUE 

CONTINUE 


MNHT2(END)-MNR0W  ;  MIN.  HEIGHT  PUT  IN  END  POSITION 
NUMBERS  ARE  ASSIGNED  TO  IVCL(N,4)  ACCORDING  TO  VARIOUS 


77 


HEIGHT  CONDITIONS  FOR  2ND  HALF  OF  VCL 

IF  < (MXROW.GE . (TS-2 )). AND . (MNROW. LE. ( BS+2 )) ) 
/  I  VCL  <  N  ,  4  )  *  1 

IF  < (MXROW. LE . (TS-2) ) .AND. (MNROW. LE . (BS+2) ) ) 
/  IVCL(N,4)»2 

IF  ( (MXROW.GE . (TS-2 )). AND . (MNROW. GE .( BS+2 )) ) 
/  IVCL(N,4)«3 

IF  ( (MXROW. LE .(TS-2)). AND. ( MNROW. GE  .  ( BS  +  2 ) ) ) 
/  IVCL(N,4)«4 

N-N+l 
CONTINUE 

TYPE  "HTVCL  DONE" 

RETURN 


ti  £ 

«  c 


************************************************************ 
SUBROUTINE  NAME:  DETECT  O ' S , U ' S , ARCHES ,  &  E*S  (DTOUAE.FR) 
WRITTEN  BY:  1 LT .  DAVID  V.  SOBOTA 

PURPOSE:  THIS  ROUTINE  DETECTS  THE  PRESENCE  OF  LETTERS  BY 
FINDING  THE  REGIONS  OF  ZEROS(  LACK  OF  DARK  PIXELS  )  NEAR 
VCL’S  AND  HORIZONTAL  LINES. 

0  : DEF INED  AS  L I NE , S P ACE , L I NE  NEXT  TO  VCL'S  ( o , a , p , b , ETC . ) 
U  .-DEFINED  AS  SPACE  AND  LINE  NEXT  TO  VCL'S  (u.v.v.y,  ETC.) 
A  : DEF INED  AS  LINE  AND  SPACE  NEXT  TO  VCL'S  ( h , n , m , r , t  ) 

E  : DEFINED  AS  L I NE , S P ACE , L I NE , S P ACE , L I NE  (a,e,g,s,z  ) 

(  8-REGIONS  IN  THESIS  -  E-REGIONS  IN  SOFTWARE  ) 
************************************************************ 

OVERLAY  DT 

SUBROUTINE  DTOUAE ( I BUF I , NROWS , NCOLS , TS , BS , I BUFO 1 , 

/  IBUF02,XI0R,XIUR,XIUR,XIER,I0R,IUR,IAR,IER) 

INTEGER  IBUFI ( 300) 

INTEGER  I ARR Y 1(128),  IARRY2(128) 

INTEGER  STFLG  < 128),STZD(128),EDZD(128) 

INTEGER  STCTUC  L28),STZU<128),EDZU(128) 

INTEGER  I BUFO 1 ( 300 ) ,  IBUF02(300) 

INTEGER  STHL,ENHL,MAX,SFLG,  MNHT 

INTEGER  NZO(128),  NZU(I28),  NZA(128),  TS  ,  BS ,  PMTSBS 

:  INTEGER  RNZO< 128) ,RNZU( 128) ,RNZA( 128) 

INTEGER  BGO( 128) , BGU( 1 28) , BGA( 128) ,  BGE(I28) 

INTEGER  EDO( 1 28 ) , EDU( l 28 ) , EDA( 1 28  )  ,  EDE(128) 

INTEGER  XEDO< 1 28) ,XEDU( 128) ,XEDA( 1 28) ,  XEDE(128) 
INTEGER  XBGO( 128)  ,XBGU( 128)  ,XBGA( 128) ,  XBGE(128) 
INTEGER  STOU(128),  STOD(128) 

INTEGER  IOR(20,3),IUR(20,3),IAR(20,3),IER(20,3) 
INTEGER  XI OR (20, 3), XI  UR (20, 3), XI AR (20, 3), XI ER (20, 3) 

DO  10  1-1, NCOLS  ;  INITIALIZE  SCAN  DOWN  COL.  VAR. 


STFLG ( I ) -0 
STZD ( I ) *0 
EDZD ( I ) -0 
STOD ( I )«0 
CONTINUE 


FLAG  INDICATE  DARK  PIXELS  ENCOUNTERED 
INDICATES  ROW  WHITE  P I XELS ( ZEROS )  BEGIN 
INDICATES  ROW  WHITE  P I XE LS ( ZERO S )  END 


THIS  LOOP  SCANS  DOWN  WORD  TO  FIND  STZD  AND  EDZD  FOR  EACH 
COLUMN  IN  FOLLOWING  CODE:  WHITE  PIXEL-15,  BLACK  PIXEL-0 


I  ROW" 1 .NROWS- 1 

CALL  GROW( I BUF I ,  IROW  , 1 , NCOLS , I ARR Y 1 

CALL  GROW (IBUFI,  I ROW+ 1 , 1 , NCOLS , I ARR Y 2  ! 

DO  30  ICOL-1, NCOLS 
IF  (EDZD( ICOL) .NE . 0)  GOTO  30 
NBZ-0 

IF  ( ( IARRY1 ( ICOL- I  )  .  EQ. 0) . OR . 

(  IARRY 1 ( ICOL+1 ) . EQ . 0) )  NBZ-l  ;  PIXELS  ARE  1 
IF  ( ( IARRY 1 ( ICOL) . EQ . 0 ) . AND . ( ( I ARRY2 ( ICOL) 
•EQ.O).OR.(NBZ.EQ.l)))  STFLG ( I COL )- 1 


PIXELS 
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IF  ( . NOT . ( ( I ARRY 1 ( I COL) . EQ . 0 ) . AND . 

( ( IARRY2 ( ICOL) . EQ . 0) .OR . (NBZ . EQ.  1 ) )  )  )  GOTO  22 
IF  (STOD(ICOL)  . EQ  .  0  )  STOD ( I COL ) - I  ROW 
STFLG ( I COL ) ■ 1 
CONTINUE 

IF  ( < IARRY2 ( ICOL)  . EQ .  1 5  )  . AND . 

< IARRY 1 < ICOL) . EQ . 1 5 ) . AND . ( STFLG( ICOL) . EQ . I ) )  GOTO  25 

GOTO  27 

CONTINUE 

STZD  < I COL ) *  I  ROW 

STFLG ( I CO  L ) “ 0 

GOTO  30 

CONTINUE 

IF  (  ( IARRY 1 ( ICOL) . EQ . 0 ) . AND . ( ( I ARRY 2 ( I COL ) . EQ . 0 ) . OR . 
(NBZ.EQ.  1 )). AND .( STZD( ICOL)  .NE.O))  EDZD ( I CO L ) - I ROW- 1 
CONTINUE 
CONTINUE 

THIS  DOES  THE  SAME  THING  AS  THE  LOOP  ABOVE  BUT  IN  AN 
UPWARD  DIRECTION. 

DO  35  I-l, NCOLS 
STFLG ( I ) -0 
STZU ( I ) -0 
EDZU  < I )-0 
STOU( I >-0 
CONTINUE 

DO  40  IROW» 1 , NROWS- 1 

CALL  GROW( IBUFI , NROWS-IROW+1 , 1 , NCOLS , I ARRY 1  ) 

CALL  GROW( IBUFI .NROWS-IROW  , 1 , NCO LS , I ARR Y 2  ) 

DO  50  I COL* 1 .NCOLS 

IF  (EDZU( ICOL)  .NE.O)  GOTO  50 

NBZ-0 

IF  ( (IARRY I ( ICOL- 1 ) . EQ . 0 ) . OR . ( I ARR Y 1 ( I COL+ 1 ) 

. EQ . 0 ) )  NBZ-1 

IF  ( (IARRY1 (ICOL) .EQ.O) .AND. ( ( I ARRY 2 ( ICOL) 
.EQ.O) .OR. (NBZ.EQ. 1)))  STFLG ( ICOL )-l 
IF  ( .NOT. ( ( IARRY1 ( ICOL) . EQ.O) . AND . ( ( I ARRY 2 ( ICOL) 

. EQ . 0) .OR . (NBZ  .  EQ  .  1  ) ) ) )  GOTO  42 

IF  ( STOU( ICOL) . EQ. 0)  STOU ( I COL ) -NROWS - I ROW+ 1 

STFLG  (  ICOD-l 

CONTINUE 

IF  ( ( IARRY2 ( ICOL) . EQ .  I  5 ) . AND . 

( I ARRY 1 ( ICOL) . EQ . 1 5 ) . AND. ( STFLG( ICOL) . EQ . 1 ) )  GOTO  45 

GOTO  47 

CONTINUE 

STZU(  ICOD-NROWS-IROW  +  1 
STFLG  (  I  COL  )  ■» 0 
GOTO  50 
CONTINUE 

IF  (( IARRY1 (ICOL) .EQ.O) .AND. ((IARRY2(IC0L) .EQ.O) .OR. 
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/  (NBZ.EQ.  1  )  )  .AND. (STZU( ICOL)  .NE.O)) 

/  ED  Z  U ( IC0L)“NR0WS- I ROW+2 

CONTINUE 
CONTINUE 


DO  60  1-1, NCOLS 

NZ0( I ) -0 

NZU ( I )-0 

NZA ( I ) *0 

CONTINUE 


INITIALIZE  NUMBER  OF  ZEROS  ARRAYS 
INDICATES  NUMBER  OF  "0"  ZEROS 

INDICATES  NUMBER  OF  "U"  ZEROS 

INDICATES  NUMBER  OF  "A"  ZEROS 


PMTS B S-INT((TS+BS)/2)  ;  POSITION  MID.  TS  AND  BS 
DO  70  I-l, NCOLS  ;  TS-  TOP  OF  SMALL  LETTERS 

;  BS-  BOTTOM  OF  SMALL  LETTERS 
TYPE  I,  "  STZU "  ,  STZU(I),  "  STZD"  ,  STZD(I) 

IF  (STZU(I).GT.  STZD(I>)  NZO ( I ) - ( S TZU ( I ) - STZD ( I ) ) + 1 
IF  < . NOT .(STZD(I).GT.STZU(I)))  GOTO  70 
IF  <<STZD(I).LT.<  BS-2  ) ) . AND . < EDZD( I ) . EQ . 0 )  ) 

/  N  Z A ( I)-BS-STZD(I) 

IF  <<STOU(I).LE.<  BS-2  ) > . AND . ( STOU( I ) . GT . 0 > ) 

/  N  Z  A ( I ) “8S-2-STOU( I )  ;  STOD - ST ART I NG  DARK  PIXEL  DOWN 
IF  ( (STZU( I ) .GT. (  TS+2  > ) . AND . ( EDZU ( I ) . EQ . 0 ) ) 

/  NZU< I ) -STZU< I ) -TS 

IF  < <STOD< I ) .GE . (  TS+2  ) > . AND . ( STOD ( I ) . GT . 0 ) > 

/  NZU( I )-STOD( I )-<TS+2 )  ;  STOU- ST ART I NG  DARK  PIXEL  UP 

CONTINUE 


DELOUA  DELETES  ALL  NZO, NZU,  OR  NZA'S  WHICH  DO  NOT  HAVE 
A  MIN.  WIDTH  OR  A  MIN.  HEIGHT  . 

TO  WORK  IN  ALL  CASES, THESE  PARAMETERS  WOULD  HAVE  TO  BE 
SCALED  TO  LETTER  DIMENSIONS. 

MINWD-2  ,  MINHT-(BS-TS)/2 
MNHT-( BS-TS)/2 

CALL  DELOUA(NZO,NCOLS , 2 , MNHT , 3 , BGO , EDO) 

CALL  DELOUA ( NZA .NCOLS , 2 , MNHT, 3 , BGA, EDA) 

CALL  DELOUA (NZU .NCOLS , 2 , MNHT , 3 , BGU , EDU) 

DTEG  TAKES  ALL  O-REGIONS  AND  DETECTS  WHETHER  THEY  MEET 
THE  CONDITIONS  FOR  SIMPLE  E-REGIONS, OR  E-REGIONS  WHICH 
EXTEND  BELOW  BS. 

CALL  DTEG ( IBUFI , BGO , EDO , NROWS .NCOLS ,TS , BS , BGE , EDE , 

/  IOR.IER)  ;*** 

CALL  DTA( IBUFI , BGA , EDA .NROWS .NCOLS ,TS , BS , IAR)  ;  *  *  * 
CALL  DTU( IBUFI , BGU, EDU , NROWS , NCOLS ,TS , BS , IUR)  ; *  *  * 

C 

C  MINWD-50  ,  MINHT-0  ;  NO  THRESHOLD 

MNHT-0 

CALL  DELOUA (NZO, NCOLS , 20 , MNHT, 3 ,XBGO , XEDO) 

CALL  DELOUA(NZA, NCOLS, 20, MNHT, 3 , XBGA , XEDA ) 

CALL  DELOUA (NZU, NCOLS, 20, MNHT, 3, X BGU, X EDU) 


.  v .  •  *«  *- V*  V.  *.  • 
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DTEG  TAKES  ALL  O-REGIONS  AND  DETECTS  WHETHER  THEY  MEET 
THE  CONDITIONS  FOR  SIMPLE  E-REGIONS, OR  E-REGIONS  WHICH 
EXTEND  BELOW  BS. 

CALL  DTEG ( IBUFI , XBGO , XEDO , NROWS , NCOLS , TS , BS , XBGE , XEDE 
/  , X  I OR , X  I ER  )  ;*** 

CALL  DTA( IBUFI ,XBGA ,XEDA , NROWS .NCOLS ,TS , BS , XIAR)  ;*** 
CALL  DTU( IBUFI , XBGU , XEDU , NROWS , NCOLS ,TS , BS , XI  UR)  ;*** 
DO  110  I- 1 , NROWS 
DO  120  J«l, NCOLS 

I ARR Y 1  (  J  )  -  1  5 

IF  ( NZO ( J ) . GE . I  )  IARRYl(J)-0 

CONTINUE 

CALL  PROW( IBUFOl  , NROWS -1+1  , 5  *  2  +  1 2 0 , NCOLS , I ARR  Y 1  ) 
CONTINUE 

DO  130  1-1,20 
DO  140  J- 1 , NCOLS 

IARRY 1 <  J  )  -  1  5 

IF  < BGO< J) . GE  .  I  >  IARRY1 (J)-0 
IF  ( EDO (J).GE.I)  I ARR Y 1 ( J ) -0 
IF  < BGE( J) .GE . I )  IARRY1 (J)-0 
IF  (EDE( J) .GE . I )  I ARR  Y 1 ( J ) -0 

CONTINUE 

CALL  PROW ( IBUFOl  , NROWS  +  2 0- 1+ 1  ,  5 *  2  + 1 2 0 . NCOLS , IARRY 1 ) 
CONTINUE 

DO  150  1-1, NROWS 

DO  160  J-l, NCOLS 

IARRY 1 ( J  )  »  1  5 

IF  (NZA< J)  .GE.  I )  IARRY 1 ( J ) -0 

CONTINUE 

CALL  PROW ( IBUFOl  , 2  * NROWS  +  2 0 - 1+ 1  ,  5 * 2  + 1 2 0  , NCOLS , IARRY 1  ) 
CONTINUE 

DO  170  1-1,20 
DO  180  J-l, NCOLS 

IARRY1 < J)-l  5 

IF  (BGA(J) .GE. I)  I ARRY 1 ( J ) -0 
IF  (EDA( J) .GE. I )  IARRYl(J)-0 

CONTINUE 

CALL  PROW ( IBUFOl , 2 * NROWS+4 0 - 1+ 1 , 5*2+120, NCOLS , IARRY 1 ) 
CONTINUE 

DO  190  1-1 , NROWS 
DO  200  J-l , NCOLS 

IARRY 1 ( J)-l  5 

IF  (NZU(J).GE.I)  I ARRY  l  ( J  )  -0 

CONTINUE 

CALL  PROW ( IBUFOl, 3 * NROWS+4 0- I + 1 ,5*2+120, NCOLS , IARRY 1 > 
CONTINUE 
DO  210  1-1,20 
DO  220  J-l, NCOLS 


********** 


******** 


SUBROUTINE  NAME:  DELETE  O.U.AND  A  REGIONS  (DELOUA.FR) 
WRITTEN  BY:  1 LT .  DAVID  V.  SOBOTA 

PURPOSE:  THIS  ROUTINE  FINDS  THE  EDGES  OF  SIGNIFICANT  O.A.U 
REGIONS  BY  DELETING  0,U,E,  REGIONS  WHICH  DO  NOT  HAVE  A  MIN 
HEIGHT  OF  MINHT  OF  DO  NOT  MIN.  WIDTH  OF  MINWD.  THE  EDGES 
OF  THE  REMAINING  REGIONS  ARE  THEN  FOUND  &  GIVEN  A  WEIGHT 
OF  WT  FOR  DISPLAY  IN  ORDER  TO  DISTINQUISH  THE  DIFFERENT 
REGIONS  .  THIS  ROUTINE  IS  CALLED  FOR  0,A, AND  U  REGIONS. 

********** 


SUBROUTINE  DELOUA  (NZ, NCOLS, MINWD , MI NHT , WT , STCL.EDCL) 

INTEGER  MAX,  SFLG ,  START,  END  ,  NCOLS 
INTEGER  NZ( 128) ,  RNZ(L28),  MINHT,  MINWD 
INTEGER  FLG ,  STCL(128),  EDCL(128> 

INTEGER  WT 

MAX-0 

SFLG-0 

START-0 

END-0 

DO  100  1-1, NCOLS 

RN2 ( I ) “NZ (  I  ) 

IF  ( . NOT . ( ( SFLG. EQ. 0) .AND. (NZ(I).GE.l))) 

GOTO  110 
START- I 
SFLG-  l 


CONTINUE 


IF  <NZ(I) .GT.MAX)  MAX-NZ(I) 

IF  ( (SFLG .EQ.  1  )  .  AND. (NZ( I ) . EQ.O)  )  END-I-1 
IF  (END. EQ.O)  GOTO  100 
DO  120  K- START , END 

IF  ( ( (END -ST ART) . LE .MINWD) . OR . (MAX . LE.MINHT) ) 
RNZOO-0  •  *  *  *  * 

CONTINUE 

SFLG-0 

MAX-0 

START-0 

END-0 

CONTINUE 

THE  FOLLOWING  FINDS  THE  EDGES  OF  THE  HOLES,  U’S,  OR 
ARCHES  THAT  SURVIVE  THE  CONDITIONS  SET  BY  "DELOUA" 

DO  10  I- 1 , NCOLS 
STCL ( I ) -0 
EDCL ( I ) -0 
CONTINUE 


FLG-0 

DO  20  1-1, NCOLS 

IF  (  .  NOT.  ( ( FLG . EQ . 0 ) . AND . ( RNZ( I ) . GT . 0 )  )  )  GOTO  25 


STCL ( I ) * WT 
FLG«  1 

CONTINUE 

IF  ( .NOT. ( (RNZ( I  )  .EQ.  0 ) . AND . ( FLG . EQ . 1 ) ) ) 

GOTO  35 

EDCL ( I -  1 )-WT 

FLG-0 

CONTINUE 

CONTINUE 


RETURN 


A 
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C  SUBROUTINE  NAME:  DETECT  A-REGIONS 
C  WRITTEN  BY:  1 LT .  DAVID  V.  SOBOTA 

C  PURPOSE:  THIS  ROUTINE  SCANS  FROM  BELOW  THE  LINE  OF  PRINT 
C  LOOKS  FOR  A  CONCENTRATION  OF  DARK  PIXELS.  IF  THE  PIXELS 
C  ARE  ABOVE  THE  LINE  OF  PRINT  (BS)  BY  SOME  THRESHOLD  THEN 
C  THIS  REGION  IS  DEFINED  AS  A  A-REGION. 

c************************************************************ 

SUBROUTINE  DTA ( IBUFI , BGA , EDA , NROWS ,  NCOLS , TS , BS , I AR ) 


I BUF I ( 300 ) 

NROWS .NCOLS , START .END 
I ARR Y 1(128),  IARRY2( 128) 

STFLG ( 128) , ST  Z  D ( 128)  ,EDZD( 128) 
STCTU ( 128) ,STZU(128) , ED  ZU ( 128) 
STHL , ENHL .MAX ,MIN , S  F  LG 
TS  ,  BS 

BGA ( 128)  ,  EDA ( 128) 

WDTH , N , K . STOD( 128) ,STOU( 128) .MID 
IAR( 20 , 3) 


INITIALIZE  PARAMETERS 
MARKS  HORIZ.  LINE  FOUND  DOWN  SCAN 
STARTING  Z  ERO ( WH I TE  PIXEL)  DOWN 
ENDING  ZERO  DOWN 


INTEGER  I BUF I ( 3  < 
INTEGER  NROWS, N< 
INTEGER  IARRY1C 
INTEGER  STFLG ( 1 : 
INTEGER  STCTU(i: 
INTEGER  STHL.ENI 
INTEGER  TS,  BS 
INTEGER  BGA (128; 
INTEGER  WDTH, N, l 
INTEGER  IAR(20,: 

DO  10  I-l .NCOLS 
STFLG ( I )-0 
STZD< I )-0 
EDZD< I )-0 
STOD< I )-0 
CONTINUE 


DO  15  K-l, NCOLS 

IF  ( BGA ( K ) . EQ . 3 )  START-K 
IF  ( EDA (K).NE.3)  GOTO  15 
END-K 


DO  35  1-1, NCOLS 
STFLG ( I ) -0 
STZU ( I )-0 

EDZU ( I ) *0 
STOU ( I  )-0 
CONTINUE 


INIT.  PARAMETERS  FOR  UPWARD  SCAN 


IROW-1 , NROWS- 1  ; SCANS  FROM  BELOW 

CALL  GROW< IBUFI , N ROWS - I ROW+ 1  ,  1  .NCOLS ,  IARRY1  ) 

CALL  GROW (IBUFI  , NROWS- I ROW  , 1  , NCOLS  ,  I AR R Y 2  ) 

DO  50  ICOL-START , END 

IF  ( EDZU< ICOL) .NE . 0)  GOTO  50 

NBZ-0 

IF  < ( I ARR Y 1 ( ICOL- 1 ) . EQ.O) .OR . ( I ARRY 1 ( ICOL+1 ) 

. EQ . 0 ) )  NBZ-1 

IF  ( ( I ARRY l(ICOL).EQ.O). AND. ( ( I ARR Y 2 ( I  COL ) 

.  EQ  .  0)  .  OR . ( NBZ . EQ .  I ) ) )  STFLG ( ICOL)- 1 
IF  (  .  NOT.  ((IARRYKI  COL).  EQ.O).  AND. 

( ( IARRY2 (  ICOL  )  . EQ . 0)  . OR .  (NBZ . EQ. 1 ) ) ) )  GOTO  42 
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n  o 


IF  (STOU( ICOL) . EQ. 0)  STOU  (  I  COL  ) -NROWS  -  I  ROW-*- 1 
STFLG ( I COL ) *  1 

42  CONTINUE 

IF  ( < IARRY2 ( ICOL) . EQ . 1 5 ) . AND. 

/  ( IARRY1 ( ICOL) . EQ .  1 5 ) . AND. ( STFLG( ICOL) . EQ .  1  )  )  GOTO  45 

GOTO  47 

45  CONTINUE 

STZU ( I COL) -NROWS -I ROW 
STFLG  < IC0L)«0 
C  GOTO  50 

47  CONTINUE 

IF  (  ( IARRY1 (ICOL).EQ.O) .AND.  ( ( I ARR Y 2 ( I COL ) . EQ. 0) .OR  . 
/  (NBZ.EQ.  1 )) .AND .( STZU( ICOL)  .NE.O)) 

/  EDZU(  ICOD-NROWS-IROW+2 

50  CONTINUE 

40  CONTINUE 

C 

WDTH- END- START-*- 1 

C  TYPE  "START-" , START , "  END-", END 

MAX-NROWS 
MIN-  I 

MI D- 1 NT ( (BS+TS) /2 ) 

DO  70  I -START , END 

C  TYPE  "EDZU" , EDZU( I  )  , "  EDZD" , EDZD ( I ) 

IF  ( ( STOU( I ) . LT .MAX) . AND . ( STOU( I ) . NE. 0) )  MAX-STOU(I) 
IF  (STOU(I).GT.MIN)  MIN-STOU(I) 

70  CONTINUE 

C  IF  ( .NOT. ( (MAX. LE. MID) . AND. (MIN.GT. (BS+1 ) ) ) )  GOTO  75 

IF  <  . NOT . ( (MAX . LE . (TS-2 )  )  )  )  GOTO  75 
BGA(START)-6  ;  INDICATE  ABOVE  TS 

EDA(END)-6 
75  CONTINUE 

C  IF  ( .NOT. ( (MAX.LT.  MID  ) . AND .  ( M I N . LT . ( BS+ 1 )  )  )  ) 

/  GOTO  80 

IF  ( .NOT. ( (MAX. GT. (TS-2)  ))  )  GOTO  80 
BGA( START)-9  ;  INDICATE  TS-BS 

EDA( END ) -9 
80  CONTINUE 

15  CONTINUE 

C 

DO  110  J-l  ,  20 
DO  120  K-  1,3 
IAR(J,K)-0 
120  CONTINUE 

110  CONTINUE 


N-  l 

DO  100  K- 1 , NCOLS 
IF  (BGA(K) .GT.O)  START-K 
IF  ( ED A ( K ) . EQ . 0 )  GOTO  100 
END-K 

I AR ( N  ,  1 ) -START 
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C  SUBROUTINE  NAME:  DETECT  U  REGIONS 
C  WRITTEN  BY:  1 LT .  DAVID  V.  SOBOTA 

C  PURPOSE:  THIS  ROUTINE  SCANS  DOWN  FROM  ABOVE  THE  WORD  AND 
C  LOOKS  FOR  A  CONCENTRATION  OF  DARK  PIXELS.  IF  THE  PIXELS 
C  ARE  BELOW  TS  FOR  SOME  THRESHOLD  THEN  THIS  REGION  IS 
C  DEFINED  AS  A  U-REGION. 
c******************************** 


SUBROUTINE  DTU( IBUFI , BGU , EDU , NROWS .NCOLS ,TS ,BS , I  UR) 

INTEGER  I BUFI ( 300 ) 

INTEGER  NROWS , NCOLS , START , END 
INTEGER  IARRY1 ( 128) t  IARRY2(128) 

INTEGER  STFLG( 128) ,STZD( 128) ,EDZD( 128) 

INTEGER  STCTU (  128),STZU(128),EDZU(128) 

INTEGER  STHL , ENHL , MAX , MIN , SFLG 

INTEGER  TS,  BS 

INTEGER  BGU( l 28 )  ,  EDU(  1  2ft ) 

INTEGER  WDTH ,N,K,STOD(128),STOU(128),MID 


INTEGER  IUR(70,3) 


DO  10  1-1 .NCOLS 
STF  LG ( I ' -0 
STZD  (  I  )  -0 
EDZD( I )-0 
STOD ( I )  -0 
CONTINUE 


INITIALIZE  PARAMETERS 
MARKS  HORIZ.  LINE  FOUND  DOWN  SCAN 
STARTING  Z  ERO ( WH I TE  PIXEL)  DOWN 
ENDING  ZERO  DOWN 


;  SCANS  DOWN 
IROW  , 1 , NCOLS , I ARRY 1  ) 

IROW+1 , 1 , NCOLS , IARRY2  ) 


DO  15  K-l,  NCOLS  -.FINDS  START  AND  END 

IF  (BGU(K) .EQ. 3)  START-K  ;  IN  A  REGIONS 
IF  (EDU(K) .NE.3)  GOTO  15 
END-K 

DO  20  IROW* 1  . NROWS- 1  ;  SCANS  DOWN 

CALL  GROW (  IBUFI ,  IROW  ,  1  ,  NCOLS  ,  I ARRY  1  ) 

CALL  GROW ( I BUFI ,  I ROW+ 1 , 1 , NCO LS , I ARR Y 2  ) 

DO  30  ICOL-START , END 

IF  (EDZD(ICOL) .NE.O)  GOTO  30 

NBZ-0 

IF  ( ( I ARRY 1 < ICOL-1 ) .EQ. 0) .OR . ( I ARRY 1  (  ICOL+1 ) 

. EQ . 0 ) )  NBZ-1  ;  NEIGHBORING  PIXELS  ARE  DARK 
IF  < ( IARRY1 ( ICOL) . EQ.O) .AND. ( (IARRY2( ICOL) 

. EQ.O).OR.(NBZ.EQ.I)))  S TF LG ( I  CO L ) -  1 
IF  (.NOT.((IARRYl(ICOL). EQ.O). AND. 

( (  I  ARRY2 (  ICOL)  . EQ . 0 ) .OR . (NBZ . EQ .  I  )  )  )  )  GOTO  22 
IF  ( STOD ( ICOL  ).  EQ . 0 )  STOD ( I  COL )« I  ROW 
STFLG ( ICOL)- l 

CONTINUE 

IF  ( (  I  ARR Y2 ( ICOL)  . EQ .  1 5 )  . AND . 

(  I  ARRY 1 ( ICOL  )  . EQ .  I  5 )  . AND . ( STFLG(  ICOL) . EQ .  1 ) )  GOTO  25 

GOTO  27 
CONTINUE 
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STZD( ICOL)-IROW  +1 
STFLG( ICOL)-0 
GOTO  30 
CONTINUE 

IF  ( ( I ARRY 1 ( ICOL) .EQ.O) .AND. ( ( I ARR Y 2 ( I COL ) . EQ.O) .OR. 
(NBZ.EQ.  1 )). AND .( STZD( ICOL)  .NE.O)) 

EDZD( IC0L)-IR0W-1  ; MARKS  END  OF  ZERO  REGION 

CONTINUE 

CONTINUE 

WDTH-END-START+l 

TYPE  "START-" , START . "  END-", END 

MAX-NROWS 

MIN-  1 

MI D- I NT ( ( BS+TS ) / 2 ) 

DO  70  I-START  ,  END 

TYPE  "EDZU" , EDZU( I )  ,  "  EDZD" , EDZD< I ) 

IF  ( (STOD( I ) . LT. MAX) . AND . ( STOD( I ) .NE . 0) )  MAX-STOD( I ) 
IF  (STOD( I) .GT.MIN)  MIN-STOD(I) 

CONTINUE 

IF  ( .NOT. ( (MAX . LE. MID) .AND . (MIN .GT. (BS+1 ))) )  GOTO  75 

IF  ( .NOT. ( (MIN. GT. (BS+1 ))) )  GOTO  75 

BGU(START)»6 

EDU ( END ) -6 

CONTINUE 

IF  ( .NOT. ( (MAX. LT. MID) .AND. (MIN. LT .( BS+1 ))) )  GOTO  80 

IF  ( .NOT. ( (MIN. LT. (BS+1  )))  )  GOTO  80 

BGU(START)-9 

EDU(END)-9 

CONTINUE 

CONTINUE 

DO  110  J-l , 20 
DO  120  K- 1  ,  3 
I  UR ( J , K ) - 0 
CONTINUE 
CONTINUE 

N-  1 

DO  100  K-l.NCOLS 
IF  (BGU(K) .GT.O)  START-K 
IF  (EDU(K) .EQ.O)  GOTO  100 
END-K 

I  UR ( N , 1 ) -START 
I  UR (N , 2) -END 
I  UR ( N , 3)-BGU(START) 

N-N+l 

CONTINUE 


TYPE  " DTU  DONE" 
RETURN 


c************************************************************ 

C  SUBROUTINE  NAME:  DETECT  "E"  AND  M G " ’ S  <  ALSO  »,z,  AND  a  ) 

C  WRITTEN  BY:  I LT .  DAVID  V.  SOBOTA 

C  PURPOSE:  THIS  ROUTINE  DETECTS  THE  PRESENCE  OF  LETTERS  WHICH 
C  HAVE  AT  LEAST  THREE  HORIZONTAL  LINE  IN  THEM  BY  LOOKING  AT 
C  LETTERS  THAT  FIRST  MEET  THE  CONDITIONS  FOR  AN  "O"  SET  IN 
C  DTOUAE.FR.  IT  THEN  SCANS  FROM  ABOVE  AND  BELOW  LOOKING  FOR 
C  THE  FIRST  AND  SECOND  HORIZONTAL  LINE.  IF  THE  BEGINNING  OF 
C  THE  SECOND  HORIZONTAL  LINE  ON  THE  ABOVE  AND  BELOW  SCAN 
C  DON'T  OVERLAP  THEN  A  e/g/s  IS  DETECTED. 
c************************************************************ 


SUBROUTINE  DTEG (  IBUFI , BGO , EDO , NROWS , NCOLS , TS , BS , BGE , 
/  EDE.IOR.IER) 

INTEGER  I BUF I ( 300 ) 

INTEGER  NROWS, NCOLS, START, END 
INTEGER  IARRY 1(128),  IARRY2(128> 

INTEGER  STFLG ( 128),STZD(128),EDZD(128) 

INTEGER  STCTU ( 128) ,STZU< 128),EDZU(128) 

INTEGER  STHL , ENHL , SFLG 

INTEGER  TS,  BS 

INTEGER  BGO( 1 28) , EDO( 1 28) 

INTEGER  BGE (128) 

INTEGER  EDE( 128) .WDTK.N.K 
INTEGER  MIN, MAX, MID 
INTEGER  IER(20,3)  ,  IOR(20,3) 


DO  10  1-1, NCOLS 


INITIALIZE  PARAMETERS 


10 

C 


MARKS  HORIZ.  LINE  FOUND  IN  DOWN  SCAN 
STARTING  ZERO ( WH I TE  PIXEL)  DOWN 
ENDING  ZERO  DOWN 
MARKf  END  OF  E  REGION 
MARKS  START  OF  E  REGION 


STFLG ( I ) -0 
STZD ( I ) “0 
EDZD ( I  )-0 
EDE( I )-0 
BGE ( I )-0 
CONTINUE 

DO  15  K-l, NCOLS  FINDS  START  AND  END  O-REGIONS 
IF  ( BGO(K) . EQ . 3 )  START-K 
IF  ( EDO( K ) . NE . 3 )  GOTO  15 
END-K 

DO  20  IROW-1 .NROWS-l 

CALL  GROW (IBUFI,  IROW 

CALL  GROW( I BUF I ,  IROW+1 

DO  30  ICOL-START, END 
IF  (EDZD( ICOL) . NE . 0 )  GOTO  30 
NBZ-0 

IF  ( ( IARRY 1 ( ICOL- 1 ) . EQ . 0 ) .OR. (IARRY 1 < ICOL+1 ) 
.  EQ  .  0  )  )  NBZ-1  ;  NEIGHBORING  PIXELS  ARE  DARK 
IF  ((IARRYl(ICOL).EQ.O). AND. (<IARRY2( ICOL) 
.EQ.O).OR. (NBZ.EQ. 1)))  STFLG ( ICOL) ■ 1 
IF  ( (  IARRY2 ( ICOL) . EQ .  1 5  )  . AND . 

( IARRY 1 (  ICOL) . EQ .  1 5 ). AND .< STFLG( ICOL) . EQ .  1 ) )  GOTO  25 


;  SCANS  DOWN 
1  .NCOLS , IARRY 1  ) 

1  .NCOLS , IARRY2  ) 


GOTO  27 
CONTINUE 

ST2D( ICOL)-IROW  ;*•** 

STFLG( ICOL)«0 
CONTINUE 

IF  ( ( IARRY1 < I COL) . EQ . 0 ) .AND. ( ( I ARR Y 2 ( I COL ) . EQ . 0 ) .OR. 
(NBZ.EQ.  1 ) ) . AND. ( STZD< ICOL)  .NE.O)) 

EDZD ( ICOL)* I ROW- 1  ; MARKS  END  OF  ZERO  REGION 

CONTINUE 

CONTINUE 

DO  35  1*1 , NCOLS  ;  INIT.  PARAMETERS  FOR  UPWARD  SCAN 

STFLG ( I ) “0 

STZU( I )-0 

EDZUC I )-0 

CONTINUE 

DO  40  IROW-1 .NROWS-l  ;SCANS  FROM  BELOW 

CALL  GR0W( IBUFI , NROWS - I ROW+ 1 , 1 .NCOLS , IARRYI  ) 

CALL  GROW( IBUFI ,  NROWS- I ROW  , 1  , NCOLS , I ARR Y 2  > 

DO  50  ICOL-START , END 

IF  ( EDZU( ICOL) . NE. 0)  GOTO  50 

NBZ-0 

IF  ( ( IARRYI ( ICOL- 1 ) . EQ. 0) .OR. ( IARRYI ( ICOL+1 ) 

. EQ . 0  )  )  NBZ-1 

IF  ( ( IARRYI ( ICOL) . EQ . 0 ) . AND . ( ( IARRY2( ICOL) 

. EQ.O) .OR. (NBZ.EQ. 1  )  )  )  STFLG (  I  COL  )  -  1 
IF  ( ( IARRY2 ( ICOL) .EQ. 15) .AND. 

( IARRY 1 (  ICOL) . EQ.  1 5) . AND. (STFLG( ICOL) . EQ .  1 ) )  GOTO  45 

GOTO  47 

CONTINUE 

STZU( ICOL)-NROWS-IROW+l  ;  **** 

STFLG ( ICOL)-0 
CONTINUE 

IF  ( (IARRY 1(1  COL). EQ.O). AND . ( ( IARRY 2 ( ICOL) . EQ . 0 ) . OR . 
(NBZ.EQ.  1 )). AND .( STZU( ICOL)  .NE.O)) 

EDZU( IC0L)-NR0VS-IR0W+2 

CONTINUE 

CONTINUE 

N-0  ; INIT.  COUNT  EDZU  GT.  EDZD  (INDICATES  E-REGIONS) 

M-0  ; INIT.  COUNT  2ND  O-RECION  BELOW  BS  (OCCURS  IN  g) 

WDTH-END-START+l 

MAX-NROWS 

MIN-  1 

MID-INT((BS+TS)/2) 

TYPE  "START-” , START, "  END-", END 
DO  70  I - START , END 

TYPE  "EDZU" , EDZU( I ) , "  EDZD" , EDZD ( I ) 

IF  ( STZU( I ) .GE . ( BS+2 ) )  M-M+l 

IF  ( EDZU( I ) .GT. EDZD( I )  )  N-N+l  ;  COUNTS  »  OF  TIMES 

;  AT  LEAST  2  O-REGION 
;  DETECTED 


IF  ( .NOT. (START. EQ . END)  >  GOTO  69 

TYPE  ”M-",M,"  N-",N,"  BS-",BS,"  TS»",TS 

TYPE  "EDZU" , E D Z U (  I)  EDZD",EDZD(I) 

TYPE  ” STZU" , STZU( I  )  ,  "  STZD" , STZD ( I  ) 

CONTINUE 

IF  ( (STZD( I > . LT.MAX) . AND. ( STZD( I ) .NE. 0)  >  MAX-STZD(I) 
IF  ( STZU< I ) .GT .MIN )  MIN-STZU(I) 

CONTINUE 

IF  ( .NOT. ( (MAX. LT. (TS-2)).AND.(MIN.LT.(MID-1)))) 

/  GOTO  60 

BGO(START)-l 
EDO ( END )  »  1 
CONTINUE 

IF  ( .NOT. ( (MAX. LT . ( TS- 3 ) ) . AND . ( MIN . GT . ( M I D  +  3 ) ) ) ) 

/  GOTO  65 

BGO( START)-3 
EDO ( END )  ■  3 
CONTINUE 

IF  ( .NOT. ( (MAX. LT. MID) . AND . (MIN .GT. ( BS  +  3 ) ) )  )  GOTO  75 

BGO(START)»6 

EDO ( END ) ■ 6 

CONTINUE 

IF  ( . NOT. ( (MAX. LT. ( MI D~  2 ) ) .AND. (MIN.GT. ( MI D+2 ) ) ) ) 

/  GOTO  BO 

BGO(START)-9 
EDO ( END ) ■9 
CONTINUE 

IF  3  HORIZONTAL  LINES  DETECTED  FOR  A  THIRD  OF  THE  LETTER 
WIDTH  THEN  A  E-REGION  IS  DETECTED. 

IF  ((N.LT.(WDTH/3)).OR.(N.LT.3))  GOTO  76 

EDE( END ) -  1  2 

BGE(START)«12 

BGO ( START ) “0 

EDO ( END ) “0 

CONTINUE 

IF  ( (M. LT. ( WDTH/3 ) ) . OR . ( M . LT . 3 ) . OR . ( BGE( START) . EQ.O) ) 
/  GOTO  95 

BGE(START)«1 5 
EDE ( END ) ■ l 5 
BG0(START)-0 
EDO ( END  )  *0 
CONTINUE 
CONTINUE 

DO  110  J-  1  ,  20 
DO  120  K«  1  ,  3 
I OR ( J , K ) »0 
IER( J,K)»0 
CONTINUE 
CONTINUE 


N-  1 

DO  100  K* 1 , NCOLS 


IF  ( BGO  ( K ) . GT . 0 )  START-K 
IF  ( EDO( K ) . EQ . 0  >  GOTO  100 
END-K 

IOR (  N  , 1 ) "START 

I OR ( N  ,  2  ) “END 

IOR (N, 3) -BGO (START) 

N-N+l 
CONTINUE 
N-  L 

DO  130  K-l , NCOLS 
IF  (BGE(K) .GT.O)  START-K 
IF  (EDE(K) .EQ.O)  GOTO  130 
END-K 

IER(N, 1 ) -START 

IER(N,2)-END 

I ER ( N , 3 ) -BGE (START ) 

N-N+l 

CONTINUE 

TYPE  " DTEG  DONE" 

RETURN 

END 


SUBROUTINE  NAME:  FIND  LOCATION  OF  "  I "  '  S  (IFIND.FR) 

WRITTEN  BY:  1 LT .  DAVID  V.  SOBOTA 

PURPOSE:  BECAUSE  THE  LETTER  "I"  IS  ONE  OF  THE  FEW  LETTERS 
WHERE  THE  VCL  DOES  NOT  OCCUR  AT  THE  EDGE  OF  THE  LETTER, 
THIS  SPECIAL  LETTER  DETECTOR  HAS  TO  BE  USED  TO  ACCOUNT  FOR 
THIS  EXCEPTION.  THIS  ROUTINE  WILL  ALSO  FIND  A  SMALL  CASE  J 


OVERLAY  IF 

SUBROUTINE  I F I ND ( NCOLS , BG VC , ED VC , VCLD , VCLU , TS , BS , I FND 
/  BG I , ED  I , IBD) 

INTEGER  VCLD (256),  VCLU<256) 

INTEGER  NCOLS,  START,  END,  SVCLD,  SVCLU 
INTEGER  I FND(  2  5  6  ),  BG I (  2  5  6  )  ,  EDI(  2  56  ) 

INTEGER  TS,BS,MAX, ITHR  ;******* 

INTEGER  BG VC ( 128),EDVC(128),IBD(20,2),N 

START  -USED  TO  INDICATE  THE  START  OF  A  VCL 
END  -USED  TO  INDICATE  THE  END  OF  A  VCL 
SVCLD-0  ;  SUM  OF  PIXELS  IN  VCL  DOWN 
SVCLU-0  ;  SUM  OF  PIXELS  IN  VCL  UP 


DO  5  1-1, 
I FN  D ( I  )-0 
BGI ( I )-0 
EDI< I )-0 
CONTINUE 


NCOLS  ; 
;  MARKS 
;  MARKS 
:  MARKS 


INITIALIZED  PARAMETERS 
BEGINNING  AND  END  OF  I  FOR  DISPLAY 
BEGINNING  OF  I 
END  OF  r 


DO  7  J-1,20 
DO  8  K-  1  ,  2 
IBD( J ,K)-0 
CONTINUE 
CONTINUE 

THIS  LOOP  FINDS  THE  EDGES  OF  EACH  VCL  AND  THEN  SUMS  THE 
VCL'S  DOWN  AND  VCL'S  UP  AND  COMPARES  THE  TWO.  IF  SUM  UP 
IS  GREATER  THAN  OR  EQUAL  TO  TWICE  SUM  DOWN  THAN  AN  LOWER 
CASE  "I"  IS  DETECTED. 

N-  1 

DO  10  I  •  1  , NCOLS 

IF  ( BGVC( I ) . EQ. 10)  START* I 
IF  ( EDVC( I ) . NE . 10)  GOTO  10 
END*  I 

MAX-0  ;******** 

DO  30  J -START , END 

IF  ( VCLD( J) . LT. 2 )  GOTO  25  ;  DOT  ON  I  HAS  TO  BE 

SVCLD-SVCLD  +  VCLD< J  )  ;  TWO  PIXELS 

CONTINUE 

S VCLU- SVCLU+ VCLU ( J ) 


IF  (VCLU(J).GT.MAX)  MAX-VCLU(J) 

CONTINUE 

TYPE  "SUMD" , SVCLD ,  "  SUMU",SVCLU 

ITHR-(BS-TS)/2+l  ;  STRAIGHT  LINE  THRESHOLD  FOR  I 
IF  ( ( INT< SVCLU/SVCLD) ) . LT. 2 )  GOTO  35  ;*** 

IF  ( . NOT. ( ( ( INT( SVCLU/SVCLD) ) . GE . 2 ) .AND. (MAX . GT . I THR ) 
. AND . (MAX . LT. ( BS-TS+2 ) ) > >  GOTO  35  ;*** 

IFND(START)-10 
I FND ( END )  -  1  0 
BGI(START>-10 
EDI ( END ) *10 
IBD(N, 1 ) - ST ART 
IBD(N , 2 ) -END 

N-N+l 

CONTINUE 

SVCLU-0 

SVCLD-0 

CONTINUE 


RETURN 


uuuuuuuuu 


SUBROUTINE  NAME:  FIND  LOCATION  LETTER  LOWER  CASE  MTM 
WRITTEN  BY:  1 LT .  DAVID  V.  SOBOTA 

PURPOSE:  BECAUSE  THIS  LETTER  IS  ONE  OF  THE  FEW  LETTERS 
WHERE  A  VCL  DOES  NOT  OCCUR  AT  THE  EDGE  OF  THE  LETTER, 
THIS  SPECIAL  ROUTINE  IS  USE  TO  DETECT  THIS  LETTER  AND 
HANDLE  THIS  EXCEPTION. 


C 


C 


5 


10 

C 

C 

C 


30 


OVERLAY  TF 

SUBROUTINE  FINDT( IBUFIN, NCOLS , NROWS , EVC ,TS , BS ,MXHT1 
/  MXHT2 , MNHT 1 , MNHT2 , BGT , EDT ,TBD) 


INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 

INTEGER 


IBUFIN<  300) 

E  VC (  128) 

LNGTH,  TS,  BS,  END(128),  START(128),  NCOLS 
SCROSS (128),  ECROSS< 128) ,TFLG 
I ARR Y  1(12  8),  IARRY2C128) 

TBEG ,  TEND,  NROWS 
SCOL , ECOL 
MAX 

ENDTH,  LTHR 

T,  M,  MTOTAL,  STTH 

BGT( 1 28 ) , EDT< 1 28 ) ,  MDTT ,  THR 

MXHT1 ( 1 28) .MNHTl ( l 28) ,  K,  DIF 

MXHT2 ( 1 28) .MNHT2 ( 1 28) ,  KST.KED,  MN ,  MX 

TSRCH , BSRCH 

TBD(20,2) , NUMT 


N-l  ;  COUNTS  NUMBER  OF  T'S  FOUND 

DO  5  1-1,  NCOLS  ;  INITIALIZE  T  PARAMETERS 

SCROSS ( I ) "0  ; MARKS  START  OF  HORIZONTAL  LINE  IN 

ECROSS ( I ) "0  ; MARKS  END  OF  HORIZONTAL  LINE  IN  T 

CONTINUE 


DO  10  1-1, NCOLS 
BGT  < I )-0 
E  DT ( I ) -0 
START( I  )-0 
END ( I )-0 
CONTINUE 


,  MARKS 
,  MARKS 
;  MARKS 
;  MARKS 


OF  T 


BEGINNING 
END  OF  T 

START  OF  EACH  VCL 
END  OF  EACH  VCL 


THIS  LOOP  PUTS  THE  START  &  END  OF  EACH  VCL  IN  AN  ARRAY 


M-l  ;  COUNTS  NUMBER  OF  VCL ' S  FOUND  IN  EVC  ARRAY 

DO  20  I - l , NCOLS 

IF  (EVC( I ) .NE  .  10)  GOTO  20 
IF  ( START(M) . NE . 0  )  GOTO  30 
START ( M ) ■ I 
GOTO  20 

CONTINUE 

END ( M ) ■ I 

M-M+l  ;  INCREMENT  COUNT 


CONTINUE 


o  n  o  o  n  n 


c 

C  THIS  LOOP  LOOKS  AT  EACH  VCL  AND  SEES  IF  IF  HAS  A 

C  HORIZONTAL  LINE  THAT  EXTENDS  BEYOND  THE  VCL  AROUND  THE 

C  "TS"  ROW. 

C 

MTOTAL-M-1  ;  TOTAL  NUMBER  OF  VCL'S 

DO  15  T- I , MTOTAL 

TYPE  ”  START",  START ( T) ,  "  END",  END ( T ) 

TYPE  " MX  1 " ,  MXHT1 (START(T) )  ,"  MX 2" ,  MXHT2 ( END ( T ) ) , 

/  "  MN1",  MNHT1 (START(T) ) ,"  MN 2 " , MNHT2 ( END ( T ) ) 

C 

C  MXHT 1 , MXHT2  CONTAIN  THE  MAX.  HEIGHT  OF  CONTINUOUS  PIXELS 
C  IN  THE  FIRST  HALF  OF  THE  VCL  AND  THE  SECOND  HALF  OF  THE 
VCL.  THESE  PARAMETERS  ARE  PASSED  FROM  HTVCL.FR 


THR-BS-TS 

KST-START(T) 

KED-END(T) 

MX-  MXHTl(KST) 

IF  ( MXHT 2 (KED) .LT.MX)  MX -MXHT 2 ( KED ) 

MN-  MNHTl(KST) 

IF  (  MNHT2 (KED) .GT. MN )  MN-MNHT2 ( KED ) 

DIF-MN-MX 

TYPE  "(BS-TS)",  THR  ,  "  (MX-MN)",  DIF 

IF  (DIF. LE. THR)  GOTO  15  ;  CHECK  IF  HEIGHT  OF  VCL  BIG 

;  ENOUGH  FOR  T 


TSRCH-TS-5 

IF  (TSRCH.LT.l)  TSRCH-1  ;  THIS  PREVENTS  SEARCH  FROM 

;  EXCEEDING  BUFFER  LIMITS 


BSRCH-TS+  S 

DO  AO  J-TSRCH , BSRCH  ;  SEARCH  WINDOW  HORIZ. 
LNGTH-0  ;  COUNTS  LENGTH  OF  HORIZ. 

TFLG-0  ;  INDICATES  WHEN  COND.  FOR  T  IS  MET 
CALL  GROW ( IBUFIN,J,  1  , NCOLS  ,  IARRY 1 ) 

CALL  GROW ( I BUFIN , J+l , 1 , NCOLS , I ARR Y2 ) 
TBEG-START(T)-10  ;  SETS  UP  SEARCH  WINDOW 

TEND-END(T)+10  ;  FOR  HORIZ.  LINE 

IF  (TBEG.LE.O)  TBEG-l 
IF  (TEND. GT. NCOLS)  TEND-NCOLS 
TYPE  "00",  "TBEG"  ,  TBEG ,  "  TEND",  TEND 


THIS  LOOP  COUNTS  LENGTH  OF  HORIZONTAL  LINE  UNTIL  A 
VERTICAL  CONTINUOUS  GAP  OF  TWO  PIXELS  OCCUR.  IT  THEN 
CHECKS  IF  THE  LINE  EXTENDS  BEYOND  THE  VCL  TO  FORM  THE 
C  T  SHAPE. 

C 

DO  50  K-  TBEG, TEND 

IF  ( ( I ARRY 1 ( K)  . EQ  .  1 5 ) . AND . 

/  ( I ARRY2 (K ) . EQ  .  1 5 ) )  GOTO  60 

LNGTH-LNGTH+l 

TYPE  "01",  "  J-",  J,  "  K-" ,  K,  "  L- " ,  LNGTH 

LTHR-2*(END(T)-START(T) ) 

STTH-START(T)-2 


98 


Iv/vlvlvl'.-l-  vLvlvlvIv-.-I*. 


.  --  v 


ENDTH«END(T)+2 

IF  (( (K-LNGTH) . LT. STTH)  .  AND  . 

/  (LNGTH. GT. LTHR  ) . AND . ( K . GT . ENDTH ) )  TFLG-1 

TYPE  ''#2","  TFLG"  ,  TFLG,"  J-",J,"  K-",K,  IARRYl(K), 

/  I ARR Y  2 ( K ) 

GOTO  55 

60  CONTINUE 

TYPE  "03","  TFLG" .TFLG, "  L"  ,  LNGTH , "  K-",K, 

/  IARRYl (K) , IARRY2(K) 

IF  (TFLG.NE.l)  GOTO  65 
ECROSS ( N ) -K-  l 
SCROSS(N)-K-LNGTH 

C  THE  NEXT  TWO  STATEMENTS  MAKE  SURE  THE  DETECTED  T  DOES  NOT 
C  OVERLAP  THE  NEXT  VCL  REGION  WHEN  THE  END  OF  THE  CROSS  IN 
C  THE  T  IS  FOUND. 

IF  ( ( (K-LNGTH) . LT . END ( T- 1 ) ) . AND . ( ( T- 1 ) . NE . 0  )  ) 

/  SCROSS(N)-END(T-l ) 

IF  < ( <K- 1 ) .GT. START (T+l ) ) . AND. ( (T+l ) . LE. MTOTAL) ) 

/  ECR0SS(N)-START(T+1  ) 

TYPE  "04",  "  N-",  N,"  ST.-T",  SCROSS(N),  "  END-T", 

/  ECROSS ( N ) 

N-N+l 
GOTO  1  5 

65  CONTINUE 

LNGTH-0 

55  CONTINUE 

50  CONTINUE 

TYPE  "06",  "  N-",  N,"  ST.-T",  SCROSS(N),  "  END-T", 

/  ECROSS ( N ) 

TYPE  "07",  "  K-”,K,  "  TFLG",  TFLG 

IF  (. NOT .< (ECROSS(N ). EQ.O ). AND . (TFLG . EQ . 1 )) )  GOTO  40 
ECROSS ( N ) -TEND 
C 

C  THE  NEXT  THREE  STATEMENTS  MAKE  SURE  EDGES  OF  THE  T  DO  NOT 
C  EXTEND  INTO  INTO  NEXT  VCL  REGION  WHEN  END  OF  CROSS  IS  NOT 
C  FOUND. 

IF  ((T+l ) .LE. MTOTAL)  ECRO S S ( N ) - S TART ( T+ 1 ) 
SCROSS(N)-TEND- LNGTH 

IF  ( ( ( TEND- LNGTH )  . LT . END ( T- 1 ) ) . AND . ( ( T- 1 ) . NE . 0 )  ) 

/  SCR0SS(N)-END(T-1 ) 

N-N+l  ;  *** 

GOTO  l 5 

40  CONTINUE 

15  CONTINUE 

C 

DO  100  J- 1 , 20  ;***** 

DO  110  K-  1  ,  2 
TBD ( J , K ) - 0 
110  CONTINUE 

100  CONTINUE 

C 

DO  90  I- 1 ,N- 1  ; ***** 


99 


TYPE  ”08","  N»",I,"  ST.  OF  T" ,  SCROSS(I),  "  END  OF  T 
ECROSS (  I  ) 

THIS  CODES  HANDLES  SPECIAL  CASE  OF  TWO  T'S  IN  A  ROW. 
IF  (SCROSS( 1+1 ) . LE.O)  GOTO  85 

IF  (. NOT .( ECROSS < I ). GT . SCROSS ( 1+1 ))  )  GOTO  85  ;  T 

MDTT-(ECROSS( I )+SCROSS( I+l ) ) /2  ;  MARKS  MIDDLE  OF  TT 

ECROS  S ( I )-MDTT 
SCROSS ( 1+1 )-MDTT 

TYPE  "  //  9 "  ,  "  N-",  I,"  ST.-T",  SCROSS(I),  ”  END-T"  , 

ECROSS (  I  ) 

CONTINUE 
SCOL-SCROSS(  I  ) 

ECOL-ECROSS( I ) 

TYPE  "/MO","  SCOL",SCOL,"  ECOL",ECOL 
IF  ( (ECOL. LE . 0) . OR . ( SCOL. LE . 0) )  GOTO  90 
BGT ( SCOL ) “10  ;  MARKS  BEGINNING  OF  T 

EDT ( ECOL ) *  1 0  ;  MARKS  END  OF  T 

TBD< I , 1 )-SCOL  ; 

TBDC I , 2 )-ECOL  ; 

CONTINUE 


RETURN 


f  '!> 


K  <.'■  L1. 
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£************************************************************ 

C  SUBROUTINE  NAME:  DECISION  ALGORITHM  (DECALG.FR) 

WRITTEN  BY:  1 LT .  DAVID  V.  SOBOTA 

PURPOSE:  THIS  ROUTINE  LOOKS  AT  EACH  VCL  IN  A  WORD  AND  ITS 
NEIGHBORING  VCL'S  AND  USES  INFORMATION  ON  THE  HEIGHT  OF 
THE  VCL'S  AND  NEIGHBORING  LETTER  FEATURES ( 0 , U , E , ARCH , I , T ) 
TO  DECIDE  ON  WHERE  TO  SEGMENT  LETTERS  THAT  ARE  TOUCHING. 

IT  ALSO  USES  THE  PRESENCE  OF  I  AND  T  TO  EVALUATE  WHETHER 
A  VCL  IS  DUE  TO  TWO  LETTERS  NEXT  TO  EACH  OTHER.  THIS 
DECISION  ALGORITHM  IS  A  REWRITTEN  VERSION  OF  THE  ORIGINAL 
DECISION  ALGORITHM  DEMONSTRATED  IN  THE  THESIS.  THIS 
VERSION  INCORPORATES  THE  ENHANCED  DECISION  RULES  WHICH 
RELY  ON  DETECTING  VARIOUS  TYPES  OF  0,U,A,8  REGIONS  IN 
ORDER  TO  GET  A  BETTER  IDEA  AS  TO  WHAT  THE  LETTER 


COMBINATION  IS  SO  THAT  IT  CAN  BE  SEGMENTED  WITH  MORE 
CONFIDENCE.  IT  ATTEMPTS  TO  RECOGNIZE  X,V,W,  LETTER 
FEATURES  SO  THAT  FALSE  SEGMENTATION  WILL  NOT  OCCUR  DUE  TO 
LETTER  COMBINATIONS  THAT  HAVE  SIMILIAR  FEATURES.  IT  ALSO 
HAS  AN  U  DETECTOR  (LU1.2)  WHICH  CAN  BE  USED  TO  ELIMINATE 
FALSE  SEGMENTATION  CLUES  DUE  TO  U-REGIONS  IN  V  AND  "TAILS" 
OF  SOME  LETTERS.  DUE  TO  TIME  CONSTRAINTS,  THE  SEGMENTATION 
OF  LETTER  COMBINATIONS  WITH  NO  VCL  AT  THE  MIDDLE  HAVE  NOT 
BEEN  INCORPORATED  AND  FURTHER  REFINING  NEEDS  TO  BE  DONE. 

c 

OVERLAY  DA 

SUBROUTINE  DEC ALG ( NROWS , NCOLS , I VCL, IAR , I  UR , I  OR , I ER , 

/  XIAR,XIUR,XIOR,XIER,GAP,IBD,TBD, IBUFO) 


./.  ■ 


L. 


3 

C 

C 

c 


INTEGER 
INTEGER 
INTEGER 
LOGICAL 
UO , UA , UE 
LOGICAL  V  I T 
LOGICAL 
LOGICAL 
LOGICAL 
INTEGER 


NCOLS,  NROWS,  IBUFO(3O0),  IARRAY(128) 

S  EG (  128) , G A  P (  128) 

P2.N1 ,V1 ,V2,  MD VC 

AO,OA,EA,AE,OE,EO,00,AA,EE,UU,OU,AU,EU, 


SPP2.SPV1  ,  SPV2  ,  SPN1 
P2BL,VIBL,V2BL,N1BL 
P2AB.V1AB, V2AB.N1AB  ,  ABBL 
VOK  ,  AO  L  1  ,UOLl  ,AOL2 , UO  L  2 ,VAU,U9 , UVA 
IVCL( 2  0 , 10) , IAR( 2  0 , 3)  ,  I  UR (20 , 3  )  ,  IOR( 2  0 , 3  )  , 

I  ER ( 2  0 , 3) 

INTEGER  XIAR(20,3),XIUR(20,3),XIOR(20,3),XIER(20,3) 
INTEGER  MVCW , VCW , TWD , IWD,SUM,TBD(20,2),IBD(20,2),WDVC 
INTEGER  MAX ,V2A,V2U,V2E,V20,V1A,V1U,V1E,V10 
INTEGER  WD , PWD , NWD , RW2A , RW20 , RW1 A , RWIO , AU2 ,UA1 
INTEGER  WDO , WDO 1 , WD02 , WDA , LU 1 , LU2 


DO  5  I -  1  , NCOLS  ; 
SEG(I)-0  ; 

IF  ( G A P (  I  )  .GT.  5) 
CONTINUE 


INIT.  SEG.  ARRAY  WHICH  INDICATES 
WHERE  LETTERS  EDGES  ARE  LOCATED. 

S  EG ( I ) ■ l 0  ;  GAPS  ARE  DISPLAYED  ON 

:  SEGMENTATION  DISPLAY 


FIND  MINIMUM  WIDTH  OF  VCL  REGION 

BEST  IF  TAKEN  FROM  LARGE  SAMPLE  OF  WORDS  TO  INSURE  THAT 


L 


L 


u  o 


C  MVCW  IS  DUE  TO  VCL  IN  ONE  LETTER. 

MVCW-NCOLS 
DO  10  N-1,20 

IF  ( I VCL ( N , 1 ) . EQ . 0  )  GOTO  20 
VCW- I VCL ( N , 2 ) - I VCL ( N , l ) + 1 
IF  (VCW. LT. MVCW)  MVCW-VCW  ;  MVCW-  MIN.  VCL  WIDTH 
10  CONTINUE 

20  CONTINUE 

C 

C  *  *  *  *  *MA I N  LOOP  FOR  INCORPORATING  FEATURES  IN  IVCL***** 

C  FIND  AVG.  WIDTH  OF  VCL  IN  I'S  AND  T'S. 

TWD-NCOLS 

IWD-NCOLS 

C  ********  BEGINNING  OF  MAIN  LOOP  ********* 

DO  30  N-  1,20 

IF  ( IVCL(N, 1 ) . EQ. 0)  GOTO  55 

K-0 

SUM-0 

DO  35  J- 1 ,20 

IF  (TBD( J, 1 ) .EQ.O)  GOTO  40 

IF  ( . NOT. <(TBD(J,1).LT.IVCL(N,1)). AND .(TBD(J,2).GT. 

/  I VCL ( N  ,  2  )  )  )  )  GOTO  35 

S  UM- S  UM+ ( IVCL(N, 2 )-IVCL(N,  1)  +  1) 

K-K+l 

IVCL(N,7)-2  ;  2-INDICATES  VCL  IS  MID.  VCL  LETTER  T. 

35  CONTINUE 

40  CONTINUE 

IF  (SUM.NE.O)  TWD- ( SUM/K )  ;  AVG.  WIDTH  OF  VCL'S  IN  T'S 

C 

K-0 

SUM-0 

DO  45  J- 1,20 

IF  ( IBD( J , 1 ) . EQ. 0 )  GOTO  50 

IF  ( .NOT. ( IBD( J , 1 ) . EQ. I VCL( N , 1 ) ) )  GOTO  45 
SUM-SUM+( IBD(J,2)-IBD(J,  1  )  +  1  ) 

K-K+l 

I VCL ( N , 7 ) - 1  ;  1 -INDICATES  VCL  IS  SAME  VCL  IN  LETTER  I. 

45  CONTINUE 

50  CONTINUE 

IF  (SUM.NE.O)  I WD- ( SUM/ K )  ;  AVG.  WIDTH  OF  VCL'S  IN  I'S 

C 

MAX-0 

DO  75  I-IVCL(N,  1 )-3,IVCL(N,  1  ) 

IF  (  GAP ( I ) . GT . MAX  )  MAX  -GAP(I)  ;STORES  MAX  GAP  NEAR 
75  CONTINUE  ;FIRST  EDGE  OF  VCL 

I VCL(N , 5 ) -MAX  ;  STORES  GAP  IN  POSITION  5 
MAX-0 

DO  85  I-IVCL(N, 2 ) , IVCL(N, 2 )+3 

IF  (GAP( I ) .GT.MAX  )  MAX  -GAP(I)  jSTORES  MAX  GAP  NEAR 
85  CONTINUE  ;2ND  EDGE  OF  VCL 

I  VC  L ( N , 6)-MAX 

CHECKS  TO  SEE  IF  VCL  NEXT  TO  LETTER  T 


102 


3- INDICATES  BEG.  OF  VCL  NEXT  TO  T 

4- INDICATES  END  OF  VCL  NEXT  TO  T 


DO  90  1-1,20 

IF  ( I VCL ( N , 1 ) . EQ . 0  )  GOTO  100 

IF  ( ( IABS( I VCL(N, 1 )-TBD( I , 2) ) ) . LE . 3)  IVCL(N,7)-3 

IF  < ( IABS( IVCL(N , 2)-TBD( I  ,  1 ) ) ) . LE. 3)  IVCL(N,7)-4 

CONTINUE 
CONTINUE 


V2A-0 

INDICATES 

2ND 

EDGE 

VCL 

NEXT 

TO 

A-REGI ON 

V20-0 

INDICATES 

2ND 

EDGE 

VCL 

NEXT 

TO 

O-REC ION 

V2U-0 

INDICATES 

2ND 

EDGE 

VCL 

NEXT 

TO 

U-REGION 

V2E-0 

INDICATES 

2ND 

EDGE 

VCL 

NEXT 

TO 

E-REGION 

V1A-0 

INDICATES 

1  ST 

EDGE 

VCL 

NEXT 

TO 

A-REGION 

V10-0 

INDICATES 

1  ST 

EDGE 

VCL 

NEXT 

TO 

O-REG ION 

V1U-0 

INDICATES 

1  ST 

EDGE 

VCL 

NEXT 

TO 

U-REGION 

V1E-0 

INDICATES 

1ST 

EDGE 

VCL 

NEXT 

TO 

E-REGION 

RW2A-0 

INDICATES 

ROW 

IN  IAR.IOR,  ETC. 

RW20-0  ; 

RW1A-0  ; 

RW 1 0-0 

****  LOOP  70  CHECKS  WHAT  REGIONS  ARE  NEXT  TO  VCL  **** 
DO  70  1-1,20 

IF  ( IVCL(N , 1 ) . EQ . 0)  GOTO  110 
****  2ND  EDGE  OF  VCL  LOOKED  AT  **** 

IF  ( .NOT. ( ( IABS( IVCL(N,2)-IAR( I , 1 ) ) > .LE. 3) )  GOTO  71 
V2  A— I AR ( I , 3 )  ;  SPECIFIC  TVPE  OF  REGION  STORED 

RW2A-I  ;  ROW  POS .  STORED  FOR  LATER  V  DETECTION 

CONTINUE 

IF  < .NOT. ( ( IABS< IVCL(N , 2 ) - I  OR ( I , 1 ) ) ) . LE . 3) )  GOTO  72 
V20-I0R(  1,3) 

RW20-I  i  ROW  STORED  FOR  LATER  X  DETECTION 

CONTINUE 

IF  ( ( IABS< IVCL(N, 2 )-IUR< I , 1 > > ) . LE . 3)  V2U-IUR(I,3) 

IF  ( < IABS< I VCL<N  ,  2 ) -IER<  I  ,  1 ) ) ) . LE . 3 >  V2E-IER(I,3) 
****  1ST  EDGE  OF  VCL  LOOKED  AT  **** 

IF  <  .NOT. ( ( IABS< IVCL(N  ,  l )-IAR( I , 2) ) ) . LE . 3) )  GOTO  73 
V 1  A- 1 AR ( I ,3) 

RW1A-I 

CONTINUE 

IF  ( .NOT. ((IABS(IVCL(N, 1 ) - I OR ( I ,2)))  .  LE . 3  )  )  GOTO  74 
V 10-I0R( I ,3) 

RW10-I 

CONTINUE 

IF  < < IABS( IVCL(N, 1 )-IUR( I , 2 ) ) ) . LE. 3)  V1U-IUR(I,3> 

IF  ( ( IABS( IVCL(N , 1 )-IER( I , 2 ) ) ) . LE. 3)  VIE-IER(I,3) 

CONTINUE 

CONTINUE 


16  LOGICAL  VARIABLE  INDICATE  REGIONS  NEXT  TO  VCL 
AO-( ( VI A.GT . 0 ) . AND. < V20 . GT. 0) )  ;  1 

0A-( ( VIO.GT.O) . AND. ( V2A.GT.0) )  ;  2 


EA-< (V1E.GT.0) 
AE-((VlA.GT.O) 
OE-((V10.GT.O) 
EO-<(VIE.GT.O) 
OO-((VlO.GT.0> 
AA-( ( VlA.GT.O) 
E  E • ( (V1E.GT.O) 
UU-( (VIU.GT.O) 
OU-((V10.GT.O) 
AU-( (VlA.GT.O) 
EU-((V1E.GT.0) 
UO-( ( V1U.GT.O) 
UA-( (V1U.GT.O) 
UE-  ( (V1U.GT.O) 


.AND.  (V2A.GT.O) ) 
.AND. ( V  2  E . GT . 0 ) ) 
.  AND . ( V2E .GT . 0 ) ) 
.AND. (V20.GT.O) ) 
.AND. ( V20.CT.0) ) 
.AND. ( V2A.GT.O) ) 
.AND. (V2E.GT.O) ) 
.AND. ( V2U.GT.O) ) 
.AND. < V2U.CT.O) ) 
.AND. ( V2U.GT.O) ) 
.AND. ( V2U.GT.O)  ) 
.AND. ( V20.GT.0) ) 
.AND.  (V2A.GT.O) ) 
.  AND. ( V2E . GT. 0 ) ) 


3 

4 
3 
6 

7 

8 

9 

10 
1  1 
1  2 
l  3 
1  4 
1  5 
16 


CHECKS  TO  SEE  IE  FEATURES  MEET  THE  CONDITIONS  FOR  A  VCL 
IN  X. 

1ST  O-REG ION  IS  CHECK  TO  SEE  IT  MEETS  CORRECT  CONDITIONS 
IF  ( . NOT. (00. AND. ( V20. EQ. 9) . AND . ( V10. EQ . 9)  )  )  GOTO  123 
TYPE  N,"  X  O-REG I ON  CONDITION  CHECK" 

DO  120  I- IOR(RV10 , 1 )-3 , I0R(RW10 *  1 ) 

IF  ( GAP ( I ) . GE . 3 )  IVCL(N,7>-6 

DO  130  J-  1  ,  20 

IF  ((XIAR(J,2).EQ.I).OR.(XIUR(J.2).EQ.I).OR. 

/  (XIER( J  ,  2) . EQ.  I  )  )  I  VCL( N , 7 ) ■ 6 
CONTINUE 
CONTINUE 

IF  ( IVCL(N , 7 ) . NE . 6 )  GOTO  119 

TYPE  N,"  MEETS  1ST  O-REGION  X  CONDITION  " 

CONTINUE 

2ND  O-REGION  IS  CHECKED  AS  ABOVE. 

DO  129  I*IOR(RW10, 2) , I0R(RW10, 2  )  +  3 
IF  (GAP( I ) .GE. 5 )  IVCL(N,7)-6 
DO  139  J-1,20 

IF  ((IAR(J,2).EQ.I).OR.(IUR(J,2).EQ.I).OR. 

/  (  IER( J , 2  )  . EQ. I  )  )  IVCL(N,7)-6 

CONTINUE 
CONTINUE 

IF  ( IVCL(N , 7 ) .NE. 6)  GOTO  138 

TYPE  N,"  MEETS  2ND  O-REGION  X  CONDITION  *' 

CONTINUE 


CHECK  WIDTH  OF  2  O-REGIONS  TO  SEE  IF  TOO  NARROW  o , b , c , ETC . 
ALSO,  CHECK  WIDTH  OF  A-REGION  BETWEEN  O-REGIONS  TO  SEE 
IF  WIDER  THAN  O-REGIONS. 

MAX-0 

WD01 -I0R(RW10 , 2 )-I0R(RW10 , 1 >+l  ;  MAX  WIDTH  OF  O-REGION 

WD02-IOR(RW20,2)-IOR(RW20, 1)+1 

DO  140  1-1,20  ;  DETERMINE  MAX.  WIDTH  OF  O-REGION 

WDO-IOR(I ,2)-I0R(I , 1 )+l 

IF  ( (WDO.GT.MAX) .AND. ( IOR( I , 3) .EQ. 9) )  MAX-WDO 
IF  ( (XIAR( I , 2) .LE. I OR ( RW2  0 , 2 ) ) . AND . ( X I AR ( I , 1 ) . GE . 


1  04 


140 


123 

125 

C 

C 

C 

C 


/ 

/ 


■  .■v.'.'J'A.ii  Lvtv.\  n  a-  - .  . 


IOR ( RW 10, 1 ) ) ) 

WDA-XIAR(I ,2)-XIAR< I , 1 )+l  ;  WIDTH  OF  A-REGION 

CONTINUE  ;  BETWEEN  O-REGIONS 

IF  ( (WDOl .LT.MAX) .AND. (WD02 . LT.MAX) )  IVCL(N,7)-6 
IF  ( (WDA.GT.WDOl ) .OR. (WDA.GT.WD02) )  IVCL(N,7)-6 
IF  < IVCL(N , 7 ) .NE . 6)  GOTO  123 

TYPE  N,"  MEETS  WIDTH  O-REGION  X  CONDITION  " 

CONTINUE 

CONTINUE 


CHECKS  TO  SEE  IF  FEATURES  MEET  THE  CONDITIONS  FOR  A  VCL 
IN  V,W,OR  Y. 

5 - I ND I CATES  POSSIBLE  VCL  IN  V,W,OR  Y. 


IF  (AU.OR.UA)  GOTO  111 

VOK-( < ( IVCL(N , 3 ) . EQ . 1 ) . AND . ( IVCL<N  ,  4) . EQ .  1 ) ) . OR . 

/  C < IVCLCN , 3) . EQ. 3) . AND. ( IVCL(N, 4) . EQ . 3) ) ) ;  VCL  IS  OK 
DO  105  1-1,20 

A0L1 •<  < I AR ( I , 2) . GE . I VCL ( N , 1 ) ) .AND. ( I AR ( I , 1 ) . LT . 

/  IVCL(N.l)));  A-VL 

U0L2«( C IUR( I ,1).LE.IVCL(N,2)) .AND. ( I UR ( I , 2 ) . GT . 

/  I VCL ( N  ,  2 ) )  )  ;  VL-U 

UOLl-( ( I  UR ( 1,2) .GE. IVCL(N, 1 ) ) . AND. ( IUR( I , 1 ) . LT . 

/  I VCL ( N , 1 ) ) ) ;  U-VL 

A0L2 - ( ( IAR ( I , 1 ) . LE. IVCL(N, 2) ) .AND. ( I AR ( I , 2 ) . GT . 

/  I VCL ( N , 2 ) )  )  ;  VL-A 

IF  ( ( (AOL1 . AND. U0L2) . OR . ( A0L2 . AND . UOL 1 ) ) . AND. VOK) 

/  IVCL(N,7)-5 

105  CONTINUE 

111  CONTINUE 

C 

30  CONTINUE  j  *****  END  OF  MAIN  LOOP 

55  CONTINUE 

C 

C  ******2ND  MAIN  LOOP  FOR  SEGMENTATION  OF  VCL****** 

C 

DO  500  N- 1 ,20 

IF  ( I VCL ( N , 1 ) . EQ.O)  GOTO  550 
IF  ( IVCL(N,7) .GE. 5)  GOTO  500 
IF  ( IVCL(N, 7 ) .NE. 1 )  GOTO  21 
SEG ( I VCL ( N , 1 ) )-5 
S EG ( I VCL ( N , 2 )  )  -  5 

TYPE  N,"  VCL  MEETS  I  CONDITIONS" 

21  CONTINUE 

IF  ( I VCL(N , 7 ) . NE . 2 )  GOTO  22 
TYPE  N,"  VCL  MEETS  T  CONDITIONS" 

DO  23  J-  1 ,20 

IF  ( .NOT. ( (TBD( J, 1 ) . LT. IVCL(N, 1 ) ) . AND. (TBD( J, 2) .GT. 
/  I VCL( N , 2 ) ) ) )  GOTO  24 
SEG(TBD( J  ,  1  )  )-5 
SEG(TBD(J,2))-5 
CONTINUE 
CONTINUE 


105 


-  -  <  **. **.  'VA 


24 

23 


CONTINUE 


V2A-0 

V20-0 

V2U-0 

V2E-0 

V1A-0 

V1O-0 

V1U-0 

VlE-0 

RW2A-0 

RW20-0 

RW1  A-0 

RW1O-0 

LU  1*0 

LU2-0 


INDICATES  2ND  EDGE  VCL  NEXT  TO  A-REGION 
INDICATES  2ND  EDGE  VCL  NEXT  TO  O-REGION 
INDICATES  2ND  EDGE  VCL  NEXT  TO  U-REGION 
INDICATES  2ND  EDGE  VCL  NEXT  TO  E-REGION 
INDICATES  1ST  EDGE  VCL  NEXT  TO  A-REGION 
INDICATES  1ST  EDGE  VCL  NEXT  TO  O-REGION 
INDICATES  1ST  EDGE  VCL  NEXT  TO  U-REGION 
INDICATES  1ST  EDGE  VCL  NEXT  TO  E-REGION 
INDICATES  ROW  IN  IAR.IOR,  ETC. 


INDICATES  IF  U-REGION  IS  PART  OF  LETTER 
U  OR  Y 


DO  60  I  *■  1  ,20 

IF  < IVCL(N, 1 ) .EQ.O)  GOTO  95 

IF  < .NOT. ( ( IABS< I VCL(N , 2 )-IAR( I , 1 ) ) ) . LE. 3) )  GOTO  61 
V2A-IAR<I,3)  ;  SPECIFIC  TYPE  OF  REGION  STORED 
RW2A-I  ;  ROW  POSITION  STORED 

CONTINUE 

IF  (  .NOT. < ( IABS< IVCL(N, 2)-IOR( I , 1 ) > ) . LE . 3)  )  GOTO  62 
V20-I0R( I ,3) 

RW20-I 

CONTINUE 

IF  ( ( IABS< IVCL(N, 2)-IUR< I , 1 ) ) ) . LE. 3)  V2U-IUR(I,3) 

IF  ((V2U.EQ.9).AND.(<IABS(IVCL(N+1,1)-IUR(I,2))).LE.3) 
•AND. (IVCL(N,4) . EQ . 1 ) . AND. ( ( I VCL ( N+ 1 , 3 ) . EQ . 1 ) . OR . 

( I VCL(N+1 , 3) . EQ . 3) ) )  LU2-1  ;  U-REGION  IN  LETTER  U  OR  Y 

TYPE  N,"  LU2-",LU2 

IF  < < IABS< IVCL<N, 2 )-IER< I , 1 ) ) ) . LE . 3)  V2E-IER(I,3) 

IF  (  . NOT. ( ( IABS( I VCL(N , l )-I AR< I , 2 ) ) ) . LE  .  3  )  )  GOTO  63 
V 1  A" I AR ( I ,3) 

RW1  A«I 
CONTINUE 

IF  ( . NOT. ( < IABS< I VCL(N ,  I  )-IOR( I , 2 ) ) ) . LE. 3 ) )  GOTO  64 
V 1 O* I OR ( I ,3) 

RW10-I 

CONTINUE 

IF  ( < IABS( IVCL(N , 1 )-IUR< I , 2 ) ) ) . LE. 3)  V1U-IUR(I,3) 

IF  ((VIU.EQ.9). AND .((IABS(IVCL(N-I,2)-IUR(I,1))).LE.3) 
. AND. ( IVCL(N- 1 , 4 ) . EQ . 1 ) . AND. ( ( I VCL(N , 3) . EQ . 1 ) .OR . 
<IVCL(N,3).EQ.3))>  LU1-1  ;  U-REGION  IN  LETTER  U  OR  Y 

TYPE  N,"  LU1-",LU1 

IF  ( ( IABS( IVCL(N , 1 >-IER( I , 2) ) ) . LE. 3)  V1E-IER(I,3) 
CONTINUE 


CONTINUE 

AO«< < VI  A .GT. 0) . AND. < V20.GT. 0) )  ;  1 
0A-( ( VIO.GT.O) . AND. < V2A.GT.0) )  ;  2 
EA-(  (V1E.GT.0) . AND. < V2A.GT.0)  >  ;  3 


u.  L2- 

r 

ft  - 

K 


AE-( (V1A.GT.0) .AND. (V2E . GT . 0 ) ) 
OE-((V10.GT.O) .AND. (V2E.GT.O)  ) 
EO-C(VIE.GT.O) .AND. ( V20.GT.O) ) 
OO-((VlO.GT.0) .AND. (V20.GT.O)) 
AA-( ( V1A.GT.O) .AND. (V2A.GT.0)) 
EE-( ( V1E.GT.O) .AND. (V2E.GT.0) > 
UU»((VIU. GT .0) .AND. (V2U.GT.0) ) 
0U“((V10. GT . 0 ) . AND . (V2U.GT.0)) 
AU»((V1A. GT .0) .AND. (V2U.GT.0) ) 
EU-< ( V1E.GT.0) . AND. (V2U.GT. 0) ) 
UO»((V1U.GT.O) .AND. (V20.GT.O)) 
UA-( (V1U.GT.0) .AND. (V2A.GT.0) ) 
UE*((V1U.GT.0) .AND. ( V2E.GT.0) ) 


CHECK  IF  VCL  TOO  WIDE  TO  BE  CAUSED  BY  ONE  LETTER 

WDVC-IVCL(N , 2 )-IVCL(N , i )+l 

WD«0 

IF  ( (WDVC.GT. (MVCWM . 5 ) > . OR . ( WDVC . GT . ( TWD* 1 .25) ) .OR. 

/  (WDVC.GT.  (  IWDM  .  25  ))  )  WD«l 
CHECK  IF  PREVIOUS  VCL  TOO  WIDE  TO  BE  CAUSED  BY  ONE  LETTER 
PWD«- 1 

IF  ((N-l).LT.l)  GOTO  83 
PWDVC-IVCL(N-1 ,2)-IVCL(N-l , I )■«•  l 
PWD-0 

IF  ( (PWDVC.GT. ( MVCW* 1 . 5 ) ) . OR . ( PWDVC . GT . ( TWD* 1 . 2 5 ) ) . OR  . 
/  ( PWDVC.GT. ( IWDM  . 25)  )  )  PWD-l  ;  PREVIOUS  VCL  WIDE 

CONTINUE 

CHECK  IF  NEXT  VCL  TOO  WIDE  TO  BE  CAUSED  BY  ONE  LETTER 
NWD« - l 

IF  (IVCL( (N+l ) , 1 ) . EQ . 0 )  GOTO  84 
NWDVC-I VCL(N-U  ,2)-IVCL(N+l  ,  l >  +  I 
NWD-0 

IF  ( (NWDVC.GT. (MVCW* 1 . 5 ) ) . OR . ( NWDVC . GT . ( TWD* 1 . 25) ) .OR. 
/  (NWDVC.  GT.  (  IWDM  .  25  ))  )  NWD-1 
CONTINUE 


FOLLOWING  CODE  CHECKS 


IF  A-REGION  AFTER 


,  W,  OR  Y. 

AU2-1  ;  INDICATES  A/U  REGION  AFTER  VCL. 

AU2-0  ;  USED  TO  DISTINGUISH  V,W,Y  FROM  U'S 
IF  ((V2A.NE.9))  GOTO  80 
DO  79  I  ,20 

IF  (IUR( J , 1 ) .EQ.O)  GOTO  80 

VAU* ( ( I UR ( J ,  l ) . LE. I AR ( RW2  A , 2 ) ) . AND . ( IUR( J , 2 ) . GT . 

IAR( RW2A , 2 ) ) ) 

U9-( I  UR ( J , 3) . EQ . 9 ) 

VOK«( (NWD.EQ.O) .AND. ( ( IVCL( N+l , 3 ) . EQ . 1 ) . OR . 

( I VCL ( NT l ,3) . EQ . 3 ) ) ) 

IF  (U9.AND. (VAU.OR. (VOK.AND. ( IVCL(N+1  ,7  )  . EQ. 5) ) ) )  AU2-1 

CONTINUE 

CONTINUE 


FOLLOWING  CODE  CHECKS 


IF  A-REGION  BEFORE  VCL 


*«  *'«.  'V 

**  v. *  .•* 


TO  V,  W,  OR  V. 

UA1-1  INDICATE  U/A  REGION  BEFORE  VCL. 

UAi-0 

IF  ((ViA.NE.9))  GOTO  109 
DO  99  J*  1 ,20 

IF  ( I  UR ( J , 1 ) . EQ . 0 )  GOTO  109 

V  AU • ( ( IUR( J  ,  2  )  . LT . IAR(RW1 A ,  1 ) ) .AND . ( I UR ( J ,  1 > . GT . 

/  I AR ( RW 1 A , 1 ) ) ) 

U9-(IUR(J,3) . EQ  .  9 ) 

VOK-<  < PWD . EQ. 0) . AND. ( ( I VCL(N- 1 , 3) . EQ. 1 ) .OR. 

/  <IVCL(N-l ,3).EQ.3)>) 

IF  (U9. AND. ( VAU.OR . ( VOK. AND. ( I VCL(N-1 , 7 ) .EQ. 5) > ) )  UAl-1 

CONTINUE 

CONTINUE 


SP —  INDICATE  IF  SPACE  IS  NEAR  VARIOUS  EDGES  OF  VCL 1 S 
SPP2“((IVCL(N-1 ,6)  . GT . 5  )  . OR . ( I VCL( N- 1  , 7 ) . EQ . 1 ) ) 
SPV1«(<IVCL(N,7).EQ.3).0R.(IVCL<N,5).GT.5)) 

SPV2-( ( IVCL(N, 7 ) . EQ. 4) .OR. ( I VCL (N, 6) .GT. 5 ) ) 

SPNl-( ( I VCL ( N+ 1 ,5) .GT. 5> .OR. (IVCLCN+1 . 7) .EQ. I ) ) 


VIT«< ( IVCL(N, 7) .EQ. 2 ) .OR. 
P2BL-< IVCLIN- l , 4).EQ.3) 
P2AB-< IVCL(N-1 , 4 ) . EQ . 2 ) 
V1BL“(IVCL(N,3).EQ.3) 
V1AB-(IVCL(N,3) .EQ.2) 
V2BL-( IVCL(N , 4) .EQ. 3) 
V2AB-( IVCL(N,4) .EQ.2) 
NIBL-l IVCL(N-*-l ,3) .EQ.3) 
N1AB-(IVCL(N+1 ,3) .EQ.2) 
ABBL-((V1AB.AND.V2BL).0R. 


(IVCL(N, 7) .EQ. 1 ) ) ;  IS  I  OR  T 


(VIBL. AND. V2AB) .OR. 


(IVCL(N,3) . EQ.4) .OR. ( I VCL( N , 4 ) . EQ . 4 ) ) 


MDVC-(IVCL(N, 1 )+IVCL<N, 2) )/2 


Vt«IVCL<N, l ) 

2  « I VCL ( N , 2 ) 

)?2»I  VCL(N-  1,2) 

N1 -IVCL(N+1 , 1 ) 

IF  ( .NOT. (ABBL) )  GOTO  26 
SEG ( MDVC ) ■ 1 0 

TYPE  N,"  ABBL  CONDITION" 
CONTINUE 


POSITION  OF  1ST  EDGE  VCL 

POSITION  OF  2ND  EDGE  VCL 

POSITION  OF  2ND  EDGE  PREVIOUS  VCL 

POSITION  OF  1ST  EDGE  NEXT  VCL 

SEG.  ON  IMPOSSIBLE  FEATURE 


IF  (WD.EQ.O)  GOTO 
SEG ( MDVC ) ■ 2 
GOTO  460 
CONTINUE 


1  1  2 


IF  2  VCL  TOO  CLOSE  TO  BE  CAUSED  BY  ONE  LETTER  SEG.  MIDDLE 
IF  (N1.EQ.0)  GOTO  150 

IF  ( (N1-V2) .LE.MVCW)  SEG ( ( < V2+N 1 ) / 2 ) ) - 3 
CONTINUE 

IF  (P2.EQ.0)  GOTO  155 


IF  ((V1-P2).LE.MVCW)  SEG ( ( ( V 1 +P2 ) / 2 ) ) -3 
CONTINUE 


IF  ( ( VIT) .OR. ( IVCL(N, 7 ) .GE. 5) )  GOTO  500 
THE  FOLLOWING  CASES  NEED  NO  ADDITIONAL  SEGMENTATION 
FEATURES  OTHER  THAN  A.O.U,  AND  8  REGIONS  AND  WIDTH  OF  VCL : 

EO  CASE 

IF  ( .NOT. ( (EO) . AND. ( VI E.EQ. 9) . AND. (WD.NE. 1 ) ) )  GOTO  300 
SEG( VI )  ■  5 
TYPE  N , " EO " 

CONTINUE 

OO  CASE 

IF  ( . NOT. ( (00) .AND. ( V20 . EQ . 9) . AND. ( V 1 O . EQ . 9 ) . AND . 

/  (WD.NE. 1)))  GOTO  305 
SEG (VI )«5 
TYPE  N,"00" 

5  CONTINUE 

UO  CASE  AND  AO  CASE  (  JUST  UO  CASE  SEGMENTED  AT  MDVC  ) 
IF  ( .NOT. ( (UO) .AND. (AO) . AND . ( V20 . EQ . 9 ) . AND . ( WD . NE . 1 ) ) ) 
/  GOTO  310 
SEG( VI ) ■ 5 
TYPE  N,"U/AO" 

D  CONTINUE 

AO  CASE  AND  U  A  1  •  1 

IF  ( .NOT. ( (AO) . AND. ( .NOT. (UO) ) . AND. ( V2 O . EQ . 9 ) . AND . 

/  (UA1 .EQ. 1 ) .AND. (WD.NE. 1)))  GOTO  315 
SEG( VI )-5 
TYPE  N , " V  O " 

5  CONTINUE 

AO  CASE 

IF  ( .NOT. ( (AO) .AND. ( .NOT. ( UO ) ) .AND. ( V20 . EQ . 9 ) . AND . 

/  (WD.NE. 1)))  GOTO  320 
SEG (VI ) ■ 5 
TYPE  N , "  AO  " 

IF  ( .NOT. ( (V1AB.OR.V2AB) .AND. (NWD.EQ. 0)))  GOTO  318 
TYPE  "MISTAKE  COULD  TO  DUE  TO  SOME  T'S" 

B  CONTINUE 

3  CONTINUE 

OA  CASE 

IF  ( . NOT. ((OA). AND. ( .NOT. ( OU ) ) . AND . ( AU 2 . EQ . 0 ) .AND. 

/  ( V  10 . EQ . 9) . AND . ( WD . EQ . 0) ) )  GOTO  325 

SEG ( V 1 ) ■ 5 
TYPE  N  ,  " 0 A " 

5  CONTINUE 

OA  CASE  AND  OU  CASE 

IF  ( .NOT. ( (OA) . AND. (OU) . AND. ( V10. EQ . 9) .AND. (WD.NE . 1 ) ) ) 
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■AVW 


GOTO  330 
SEG( V2  )“5 
TYPE  N,"OA/U" 

CONTINUE 

OA  CASE  AND  AU2-1 

IF  ( .NOT. ( (OA) . AND. ( .NOT. (OU) ) . AND . ( AU 2 . EQ . 1 ) .AND. 
(WD.NE.l)))  GOTO  335 
SEG( V2)-5 
TYPE  N , " O  V " 

CONTINUE 

EA  CASE 

IF  ( .NOT. ( (EA) .AND. ( .NOT. (EU)).AND.(V1E.EQ.9) .AND. 
(WD.NE.l)))  GOTO  340 
SEC (VI )  ■  5 
TYPE  N , "EA" 

CONTINUE 

AA  CASE 

IF  ((AA) .AND. (NIT) .AND. ( WD . EQ . 1 ) )  SEG(MDVC)-5 

IF  ( .NOT. ( (AA) .AND. (UA1 .NE. I ) .AND. ( . NOT . ( U A . OR . AU ) ) .AND. 

(WD.NE.l)))  GOTO  345 

SEG(Vl  )-5 

TYPE  N , " AA" 

CONTINUE 

AA  CASE  AND  UA  CASE 

IF  ( .NOT. ( (AA) .AND. (UA) .AND. ( UA 1 .NE. I ) .AND. 

(WD.NE.l)))  GOTO  350 
SEG( VI )-5 
TYPE  N  ,  "  U / AA " 

CONTINUE 

AA  CASE  AND  UAl-1 

IF  ( .NOT. ( (AA) . AND. (UA1 .EQ. 1 ) . AND. (WD.NE. 1 ) ) )  GOTO  355 
SEG ( V 1 ) • 5 
TYPE  N,"VA" 

CONTINUE 

UA  CASE 

IF  ( .NOT. ( (UA) . AND. ( . NOT. ( A A . OR . UU ) ) . AND . ( AU2 . NE . 1 ) .AND. 
(WD.NE.l)))  GOTO  360 
SEG(V1 )-5 
TYPE  N  ,  "  U  A  " 

CONTINUE 

AG  CASE 

IF  ( .NOT. ( (AE) .AND. ( . NOT . ( UE . OR . ( UA 1 . EQ . 1 ) ) ) .AND. 
(V2E.EQ. 1 5) .AND. (WD.NE. 1 )) >  GOTO  365 
SEG( VI ) ■ 5 

TYPE  N , "AG  FEATURE" 

CONTINUE 


OG  CASE 

IF  ( .NOT. ( (OE) . AND .(V2E.EQ.15). AND. (WD.NE. 1 ) ) ) 

GOTO  370 
SEG( VI )-5 

TYPE  N,"OG  FEATURE" 

CONTINUE 

EG  CASE 

IF  ( .NOT. ( (EE) .AND. ( VIE. EQ. 1 2) .AND. ( V2E.EQ. 1 5) .AND. 
(WD.NE. 1)))  GOTO  375 
SEG( VI  )  ■  5 
TYPE  N , "EG” 

CONTINUE 

GE  CASE 

IF  ( . NOT. ((EE). AND. (VIE. EQ. 15) .AND. ( V2E. EQ. 1 2 ) .AND. 

(WD. NE. 1 ) ) )  GOTO  380 
SEG ( V  2 ) “ 5 
TYPE  N,"GE" 

CONTINUE 

GG  CASE 

IF  ( .NOT. ( (EE) .AND. ( V1E.EQ.  1 5) .AND. ( V2E.EQ. 1 5 ) .AND. 
(WD.NE. 1)))  GOTO  385 
S  EG ( MD VC ) ■ 5 
TYPE  N , "GG" 

CONTINUE 

GU  CASE 

IF  ( . NOT. ( ( EU) .AND. ( . NOT . ( EA . OR . ( AU2 . EQ . 1 ) ) ) .AND. 

( V 1 E. EQ. 1 5 ) . AND. (WD.NE . 1 ) ) )  GOTO  390 
SEG ( MDVC ) ■ 5 
TYPE  N , " G  U " 

CONTINUE 

GA  CASE  AND  GU  CASE  OR  AU2-1 

IF  ( .NOT. ( (EA) . AND. ( (AU2 . EQ. 1 ) .OR. (EU) ) .AND. ( V 1 E . EQ . 1 5 ) 
.AND. (WD.NE. 1 )) )  GOTO  395 
SEG(V2)-5 
TYPE  N,"GV" 

CONTINUE 

AE  CASE  ********* 

IF  ( . NOT. ( (EA) . AND. ( .NOT. ( UE . OR . ( UA 1 . EQ . 1 ) ) ) . AND . 

( V2E . EQ . 1 2 ) . AND . (WD .NE .  I  )  )  )  GOTO  400 
SEG ( V  2 ) ■ 5 
TYPE  N , " A E " 

IF  (P2AB.OR.P2BL)  GOTO  397 
TYPE  "POSSIBLE  MISTAKE  IF  re" 

CONTINUE 

CONTINUE 


OE  CASE 

IF  ( .NOT. ( (OE) . AND. < VIO. EQ . 9) . AND. ( V2E.EQ. 1 2) 

. AND . ( WD . NE . 1 ) ) )  GOTO  403 
SEG ( V2 ) ■ 5 
TYPE  N , " O  E " 

CONTINUE 

EE  CASE  ******** 

IF  ( .NOT. ( (EE) . AND .(V1E.EQ.12).AND.(V2E.EQ.12) 

. AND .  (  WD . NE . 1 ) ) )  GOTO  4i0 
SEG< VI )-5 

TYPE  N ,  "EE-  POSSIBLE  MISTAKE  DUE  TO  aa  or  as" 

CONTINUE 

UE  CASE 

IF  ( . NOT. < (UE) .AND. ( VI E. EQ. 1 2 ) .AND. ( . NOT. (AE.OR . 

( UA 1 . EQ .  1  )  )  )  . AND . ( WD . NE .  1 ) ) )  GOTO  415 
SEG (  V  2  )  "  5 
TYPE  N , " U  E " 

CONTINUE 

VE  CASE 

IF  ( .NOT. ( ( AE) .AND. ( ( UE ) . OR . ( UA 1 . EQ . 1 ) ) . AND . ( V2E . EQ . 1 2 ) 
.  AND. (WD.NE.  1 ) ) )  GOTO  420 
SEG( VI )-5 
TYPE  N, " V  E " 

CONTINUE 

UV  CASE 

IF  (  .  NOT. ( ( UA ) .AND. ((UU).OR.(AU2.EQ.l>> 

.AND. (WD.NE. 1 )) )  GOTO  425 
SEG ( V  2 ) ■ 5 
TYPE  N , " U  V " 

CONTINUE 

VU  CASE 

IF  ( . NOT. ( ( AU) . AND. ( ( UU ) . OR . ( U A  1  . EQ .  1  ) ) 

.AND. (WD.NE. 1)))  GOTO  430 
SEG ( V 1 ) *  5 
TYPE  N , " VU" 

CONTINUE 

OV  CASE 

IF  ( . NOT. ((OA).AND.((OU).OR.(AU2.EQ.l)) .AND. ( VIO. EQ. 9) 
•AND. (WD.NE. 1 )) )  GOTO  435 
SEG ( V  2 ) ■ 5 
TYPE  N , " 0  V " 

CONTINUE 

OU  CASE 

IF  ( . NOT. ( (OU) . AND. ( .NOT. ( OA . OR . ( AU2 . EQ . 1 ) ) ) . AND. 

(VIO. EQ . 9  ) . AND . (WD . NE  .  1 ) ) )  GOTO  440 
SEG( VI ) ■ 5 
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TYPE  N,"VE" 

)  CONTINUE 

AU  CASE 

IF  ( . NOT. < ( AU) . AND. ( . NOT.  ( (UU> . OR . ( UA ) ) ) . AND. ( VI  A . EQ. 9  ) 
/  . AND. (WD.NE. I ) ) )  GOTO  445 

SEG< VI )  ■  5 
TYPE  N , "AU" 

>  CONTINUE 

AV  CASE 

IF  ( . NOT. ( ( AA> . AND. ( (AU) . OR . ( AU2 . EQ. 1 ) ) . AND. ( VI  A . EQ. 9  ) 

/  . AND . < WD . NE . 1 ) ) )  GOTO  450 

SEG( VI ) • 5 
TYPE  N , ” A  V ” 

)  CONTINUE 

EU  CASE 

IF  ( . NOT. ((EU).AND.(.NOT.(EA.OR.(AU2.EQ.1))> .AND . 

/  ( VI E . EQ . 1 2 ) . AND . (WD. NE . I ) ) )  GOTO  455 

SEG( VI ) *  5 
TYPE  N  ,  ”  E  U  ’’ 

>  CONTINUE 

EV  CASE 

IF  ( .NOT. ( (EA) . AND. ( (EU) .OR. (AU2 . EQ . 1 )) .AND. (VIE. EQ .12) 

/  .AND. (WD.NE. 1 )) )  GOTO  460 
S  EG ( V  2 ) ■ 5 
TYPE  N,”EV" 

)  CONTINUE 

THE  FOLLOWING  RULES  DETERMINE  SEGMENTATION  LOCATION  BASED 
ON  THE  NONEXISTENCE  OF  FEATURES  IN  LETTERS  IN  THE  ENGLISH 
ALPHABET : 


IF  ((VIA. EQ . 9 
/  SEG(MDVC)-SEG 
IF  ((V1A.EQ.9 
/  SEG( VI )-SEG( V 

IF  ((V2A.EQ.9 
I  SEG(MDVC)*SEG 
IF  ((V2A.EQ.9 
/  ((VlAB.OR.V2 
/  SEG( IAR(RW2A 

IF  ((V20.EQ.9 
/  . AND . ( S  PN l ) ) 

IF  ((VIE. GE . 1 
/  SEG(MDVC)-SEG 
IF  ((VIE.GE.  1 


A-REGION  FOLLOWED  BY  LONG  VCL 
) .AND. (WD . EQ.  I ) .AND. ( (V2BL.0R. V  2  A  B )  )  ) 
(MDVC)+5 

) . AND. (WD . EQ.O) . AND. ( ( VI BL.OR. VI  AB)  )  ) 

1  )  +  5 

LONG  VCL  FOLLOWED  BY  A-REGION 
) . AND . ( WD . EQ.  I ) . AND . ( ( V 1 BL.OR . VI AB)  )  ) 
(MDVC )+5 

).  AND.  (WD. EQ.O). AND. (AU2. EQ.O). AND. 

AB)  )  ) 

,2)+3)-5  ;  ****  POSSIBLE  k  OR  h 

LONG  VCL  FOLLOWED  BY  O-REGION 
) . AND. ( WD . EQ. 1 ) .AND. ( ( V 1 BL . OR . V 1 AB ) ) 
SEG(MDVC)-SEG(MDVC)+5 

E-REGION  FOLLOWED  BY  LONG  VCL 
2) .AND. (WD.EQ.  I ) .AND. ((V2BL.OR.V2AB)  )) 
(MDVC)+5 

2).AND.(WD.EQ.O).AND.((VIBL.OR.V1AB))) 


#••*■*•*•*  •  *  «  "  \  '»  •» 
•  ■  •  »  '  •  ’.a  *  j  *  •  »  *.»  _■  /•  .*  .»  J 
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SEG( VI ) »  S  EG (VI )  +  5 


LONG  VCL  FOLLOWED  BY  E-REGION 
IF  ((V2E.GE.  12) .AND. (WD.EQ.  1 ) .AND. ((V1BL.OR.V1AB))) 
SEG(MDVC)-SEG(MDVC)+5 

IF  ((V2E.GE. 12). AND. (WD.EQ. 0) . AND . ( (V2BL.OR.V2AB) ) ) 
SEG(V2)-SEG(V2)+5 

U-REG I  ON  FOLLOWED  BY  LONG  VCL 
IF  ( (V1U.EQ.9) .AND. (WD.EQ. 1 ) .AND. ((V1BL.OR.V1AB))) 
SEG(MDVC)-SEG(MDVC)+5 

IF  (((V1A.EQ.9).0R.(V1U.EQ.9)) .AND. (WD. EQ. 0) .AND. 

( UA1 . EQ.  1  )  . AND . ( (V2BL) .OR. (V2AB)) )  SEG(Vl)-SEG(Vl)+5 

LONG  VCL  FOLLOWED  BY  U-REGION 
IF  ( (V2U.EQ.9) .AND. (WD.EQ. 1 ) .AND. ( (V1BL.0R.V1AB)  )  ) 
SEG(MDVC)-SEG(MDVC)+5 

IF  (((V2A.EQ.9).0R.(V2U.EQ.9)). AND. ( WD . EQ . 0 ) . AND . 

( AU2 . EQ.  1 ) .AND. ( ( V  2  B  L ) .OR. (V2AB)))  SEG(V2)-SEG(V2)+5 

CONTINUE 

CONTINUE 

OUTPUT  RESULTS 

DO  200  I • l , NROWS 
DO  210  J-l.NCOLS 

I ARRAY ( J ) *  1 5 

IF  ( S  EG (J).GE.I)  I ARRAY ( J ) "0 

CONTINUE 

CALL  PROW ( I BUFO .NROWS- 1+1 , 1 , NCOLS , I ARRA Y ) 

CONTINUE 


RETURN 
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